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Abstract 

It is a commonplace in the philosophy of physics that any local physical theory 
can be represented using arbitrary coordinates, simply by using tensor calculus. 
On the other hand, the physics literature often claims that spinors as such cannot 
be represented in coordinates in a curved space-time. These commonplaces are 
inconsistent. What general covariance means for theories with fermions, such as 
electrons, is thus unclear. In fact both commonplaces are wrong. Though it is 
not widely known, Ogievetsky and Polubarinov constructed spinors in coordinates 
in 1965, enhancing the unity of physics and helping to spawn particle physicists' 
concept of nonlinear group representations. Roughly and locally, these spinors 
resemble the orthonormal basis or "tetrad" formalism in the symmetric gauge, 
but they are conceptually self-sufficient and more economical. The typical tetrad 
formalism is de-Ockhamized, with six extra field components and six compensating 
gauge symmetries to cancel them out. The Ogievetsky- Polubarinov formalism, by 
contrast, is (nearly) Ockhamized, with most of the fluff removed. As developed 
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nonperturbatively by Bilyalov, it admits any coordinates at a point, but "time" 
must be listed first. Here "time" is denned in terms of an eigenvalue problem 
involving the metric components and the matrix diag(— 1, 1, 1, 1), the product of 
which must have no negative eigenvalues in order to yield a real symmetric square 
root that is a function of the metric. Thus even formal general covariance requires 
reconsideration; the atlas of admissible coordinate charts should be sensitive to the 
types and values of the fields involved. 

Apart from coordinate order and the usual spinorial two-valuedness, (densi- 
tized) Ogievetsky-Polubarinov spinors form, with the (conformal part of the) met- 
ric, a nonlinear geometric object, for which important results on Lie and covariant 
differentiation are recalled. Such spinors avoid a spurious absolute object in the 
Anderson-Friedman analysis of substantive general covariance. They also permit 
the gauge-invariant localization of the infinite-component gravitational energy in 
General Relativity. Density-weighted spinors exploit the conformal invariance of 
the massless Dirac equation to show that the volume element is absent. Thus 
instead of an arbitrary nonsingular matrix with 16 components, 6 of which are 
gauged away by a new local 0(1, 3) gauge group and one of which is irrelevant due 
to conformal covariance, one can, and presumably should, use density- weighted 
Ogievetsky-Polubarinov spinors coupled to the 9-component symmetric square root 
of the part of the metric that fixes null cones. Thus ^ of the orthonormal basis is 
eliminated as surplus structure. Greater unity between spinors (related to fermions, 
with half-integral spin) and tensors and the like (related to bosons, with integral 
spin) is achieved, such as regarding conservation laws. 

Regarding the conventionality of simultaneity, an unusually wide range of e 
values is admissible, but some extreme values are inadmissible. Standard simul- 
taneity uniquely makes the spinor transformation law linear and independent of the 
metric, because transformations among the standard Cartesian coordinate systems 
fall within the conformal group, for which the spinor transformation law is linear. 
The surprising mildness of the restrictions on coordinate order as applied to the 
Schwarzschild solution is exhibited. 

Keywords: electron, spinor, fermion, general covariance, geometric object, nonlinear 
group representation, conventionality of simultaneity, Schwarzschild solution 
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1 Introduction 



1.1 A No-Go Theorem from c. 
1960s— But Who Noticed? 



1930 Circumvented in the 



Around the 1930s there arose in the mathematical physics community a no-go theorem 
pertaining to quantum mechanics, to the effect that there cannot be a treatment of 
certain distinctive of features of quantum mechanics in ways familiar from classical 
physics, not even classical relativistic field theory. In the mid-1960s, with some clues 
in the early 1950s, the irrelevance of the theorem was shown by explicit construction 
of a formalism that did basically what the no-go theorem allegedly had shown to be 
impossible; some of the theorem's technical assumptions were seen to be much less 
evident than they had seemed. And yet great masses of literature went on with business 
as usual for decades, as though the progress at mid-century never happened. I do not 
have in mind to discuss the names of Von Neumann Neumann, 1955| as proving a 



no-hidden-variables theorem around 1930, Bohm Bohm, 1952 as showing a way around 
it in the early 1950s, and Bell as providing systematic treatment of the issue in the 
mid-1960s |Bell, 1987 , as the reader might have assumed. I have in mind another 
story involving theorem involving Weyl |Weyl, 1929b[ |Weyl, 1929a[ |Weyl, 1929c 



Fock Fock, 1929 , Infeld and van der Waerden jlnfeld and van der Waerden, 1933 



and Cartan Cartan, 1 966 in the period 1929-37, with Weyl and Cartan stating a 
no-go theorem with special clarity and force; Bryce and Cecile DeWitt as providing 
clues regarding another way in the early 1950s |DeWitt and DeWitt, 19521 ; and sub- 
stantial clarification and resolution in the mid-1960s by V. I. Ogievetskyn and I. V. 
Polubarinov Ogievetskii and Polubarinov, 1965 Ogievetsky and Polubarinov, 1965b 



Ogievetsky and Polubarinov, 1965a Borisov and Ogievetskii, 1974 Bilyalov, 1992 



Schucker, 2000 , Bilyalov, 2002 Iochum and Schiicker, 2006] . Like Bell's, this clarifica- 
tion in the 1960s was not, from a sociological standpoint, by any means an immediate 
success. Unlike the hidden variable issue, the question of the possibility of spinors as 
such in coordinates is still shrouded in confusion and neglect. Resolving that problem 
motivates this paper. 

The no-go theorem in question is to the effect that there are no finite- component 
spinor fields with linear spinorial transformation laws under arbitrary coordinate trans- 



lr This closing vowels of this name are transliterated in different ways. I use a "y" for convenience 
when no single specific paper is in view, while retaining the spelling actually used by each citation in 
the bibliography. 
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formations |Weyl, 1929b[ |Weyl, 1929c[ |Cartan, 1966HGerfand et al., 1963] ; it is also tac 



itly assumed that the spinor transformation law ought not to involve any other fields. 
This no-go theorem, with the tacit qualification made explicit, evidently is true§| That 
the theorem does not have the significance that it seems to have will become evident 
after dropping the tacit exclusion of other fields, hedging on linearity (retaining linear 
dependence on the spinor but admitting nonlinear dependence on the other field, namely, 
the metric), and hedging a little on the arbitrariness of the coordinate transformations. 
At times imprecision and failure to be aware of Ogievetsky-Polubarinov spinors has lead 
to strictly false assertions. More formal mathematical works are more likely to start with 
truth and end with false glosses, rather than starting with falsehood. Weyl's statements 
below mix truth and falsehood. It would be tedious to intervene sentence by sentence 
to separate the true from the false on every occasion, but below I will do that with the 
1958 work of Gel'fand, Minlos and Shapiro |Gel'fand et al., 1963 for illustration. 



The no-go theorem in question is usually taken to show that one cannot have spinors in 
a curved space-time with spinorial behavior under coordinate transformations. Hence one 
defaults (typically) to the orthonormal basis formalism, in which spinors are coordinate 
scalars and behave spinorially with respect to a new local 0(1, 3) group of rotations and 
boosts of the orthonormal basisifl This new gauge group reflects by how much the content 
of an orthonormal basis exceeds that of the metric tensor to which it corresponds. (The 
orthonormal basis is often called a vierbein or tetrad in four dimensions.) Fock points 
out that an orthonormal basis is necessary for his "half-vector" formalism |Fock, 19"29 



Weyl is very perspicuous, not limiting his claim to those formalisms presently known 
while admitting room for future innovation. All possible formalisms for spinors do not fit 
within tensor calculus, or (presumably) any sensible generalization thereof, he evidently 
holds: 

The components of [the tetrad] e(a) in this coordinate system are designated 



2 Perhaps a few other premises too obvious to mention, such as that the spinor not vanish everywhere, 
are also involved. I aim here only for a physical, not mathematical, level of rigor. One theme of this 
work is the sometimes illusory character of the perfection suggested by precise modern mathematics. One 
needs to keep checking the intended domain of phenomena described to ascertain the appropriateness of 
the axioms. 

3 Sometimes one sees formalisms, whether old or new, that profess to do without a tetrad by adding 
even more surplus structure and an even larger gauge group to cancel it out. Belinfante's critique of such 
works, including Schrbdinger's, seems apt |Belinfante, 1940] , It isn't very surprising, or very interesting, 
that one can devise a formalism with additional surplus structure and hide the tetrad in it. For my 
purposes one succeeds in doing without a tetrad only if there is nothing equivalent to a tetrad in the 
formalism; certainly there should not be additional junk. 
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by e p (a). We need such local cartesian [sic] axes e(a) in each point P in 
order to be able to describe the quantity ip by means of its components ipf, 
■02 j tpi j V'Jj f° r the law of transformation of the components ?/> can only be 
given for orthogonal transformations as it corresponds to a representation of 
the orthogonal group which cannot be extended to the group of all linear 
transformations. The tensor calculus is consequently an unusable instrument 
for considerations involving the ip. |Weyl, 1929b| 

In that same year Weyl explained that 

The tensor calculus is not the proper mathematical instrument to use in trans- 
lating the quantum-theoretic equations of the electron over into the general 
theory of relativity. Vectors and terms [tensors?] are so constituted that the 
law which defines the transformation of their components from one Cartesian 
set of axes to another can be extended to the most general linear transforma- 
tion, to an affine set of axes. That is not the case for quantity ip, however; 
this kind of quantity belongs to a representation of the rotation group which 
cannot be extended to the affine group. Consequently we cannot introduce 
components of ip relative to an arbitrary coordinate system in general relativ- 
ity as we can for the electromagnetic potential and field strengths. We must 
rather describe the metric at a point P by local Cartesian axes e(a) instead 
of by the g pq . |Weyl, 1929c 



He has similar remarks in the second edition of the book on group theory and quantum 
mechanics |Weyl, 19311 p. 219]. 



Cartan dramatically ends his book with the following 

THEOREM. With the geometric sense we have given to the word "spinor" 
it is impossible to introduce fields of spinors into the classical Riemannian 
technique; that is, having chosen an arbitrary system of co-ordinates x % for 
the space, it is impossible to represent a spinor by any finite number N 
whatsoever, of components u a such that the u a have covariant derivatives of 
the form 

U a .i = -q-[+ A ««/3, 

where the are determinate functions of x h [Footnote referring to 
Infeld and van der Waerden, 1933] and italics for most of the theorem have 



been suppressed.]. |Cartan, 1966[ p. 151] 
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Such a result seems very decisive. Yet it is ultimately misleading, because key premises 
that do the work of excluding Ogievetsky-Polubarinov spinors (if one may speak prolep- 
tically) are not even stated in the theorem. 

As Scholz notes, the impossibility of having spinors in coordinates and consequent 
necessity of the use of an orthonormal basis was recognized by its inventors as a conceptual 
innovation Sc holz, 2005| . Clearly they did not hedge their bets by presenting their 
results modestly as technical results subject to interpretive qualification by possible future 
technical developments. Had they done so, they would be blameless despite subsequent 
developments. While no one should be faulted (in non-Whiggish history) for failing 
to prophesy the future of mathematical physics, more modest interpretations of results 
do have the virtue of leaving one less exposed to future refutation from unimagined 
quarters. While the theorem that there is no linear spinor transformation involving 
arbitrary coordinates is quite correct, its significance is less than Weyl, Cartan and 
their many followers have thought. On the contrary, the ambition of authors such as 
Schrodinger Schrodinger, 1932] to avoid a tetrad in coupling spinors to gravity were 
belatedly vindicated in an adequate formalism without some other surplus structure, as 
Ogievetsky and Polubarinov point out. 

In fact Weyl's and Cartan's conceptual innovation of the inadequacy of tensor cal- 
culus, founded as it was on incomplete technical results involving an arbitrary tacit 
premise that would be exposed by OP in 1965, was wrong. It is an artifact of the state of 
mathematical physics in the 1930s, as opposed to developments in the 1950s and 1960s in- 
volving George Mackey's induced group representations (though connection between the 
literature involving Ogievetsky-Polubarinov spinors and such pure mathematics was slow 
in coming), nonlinear geometric objects in differential geometry, nonlinear group repre- 
sentations in particle physics, and the like — various more or less independent parallel 
lines of development. Yet somehow the main body of differential geometry has perse- 
vered with the 1930s tetrad technology nonetheless, expressing it in more modern- looking 
form (e.g., Lawson and Michelsohn, 1989| ), in terms of bundles, various words ending in 
"morphism," and the like. It therefore now looks more advanced than most presentations 
of the OP formalism. (Indeed it is more advanced, as far as extant treatment of global 
issues is concerned. It is noteworthy that the OP symmetric square root of the metric 
exists wherever the metric does; it suffers no topological obstruction.) But that is largely 
an illusion created by comparing apples and oranges, which would quickly disappear in 
comparing Ogievetsky and Polubarinov to Weyl and Cartan. The fact remains that the 
orthonormal basis formalism that forms the backbone of much modern global work on 
spinors in differential geometry is built on what was known and not known in the 1930s, 
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some of which became dated in the 1950s and 1960s. The sociological fact of the per- 
severance of obsolete results due to historical accidents is, clearly, not in itself a good 
reason for retaining those results. Of course there could be excellent reasons for working 
the OP formalism using the modern style. This paper is premised partly on the mod- 
est idea that one should be able to walk before trying to run. Whereas tensor calculus 
was used and diffused for decades in component form in many books, the rather more 
demanding formalism of nonlinear geometric objects and the like, including OP spinors, 
has never had a pedagogical presentation. And modern treatments are not especially 
easy to produce or to relate to classical ones |Spivak, 1979 p. v] [Ea rman, 19 89, p. 159]. 



This component treatment should facilitate a modern treatment by making OP spinors 
known and intelligible. Some reasons why a 'coordinate-free' treatment of nonlinear geo- 
metric objects is not as trivial as slapping basis vectors onto the outside of a set of tensor 
components will become evident below. 

Besides false statements about spinors and coordinate generality, one sometimes finds 
discussions that start out true and illuminating, become misleading (such as by being 
true for the wrong reason), and conclude with a false gloss. The 1958 discussion by 
Gel'fand, Minlos and Shapiro is illustrative. In light of Ogievetsky-Polubarinov spinors, 
I will make a running commentary in square brackets in italic font. 

We note that the tensor representation of the Lorentz group. . . may be ex- 
tended to a representation of the entire group of non-degenerate linear trans- 
formations a in four-dimensional space. In order to do this it suffices to 
substitute into this formula the matrix of the linear transformation a. [They 
seem to envision a rigid rather than position- dependent linear transformation, 
but that does not matter for present purposes; we are not yet worried about co- 
variant and Lie derivatives.] Thus, the tensors. . . which we considered above 
only in orthogonal coordinate systems (i.e. those systems which transform 
into one another by Lorentz transformations) may be written down in any 
coordinate system. 

The situation is different for the spinor representations. . . , which are not 
equivalent to tensor representations. . . . These representations of the Lorentz 
group cannot be extend to a representation of the entire group of linear trans- 
formations of four-dimensional space. [This statement is somewhat vague; its 
truth hangs by a thread. As will appear below, it is true for metrics with indef- 
inite signature, not for the reason that they envisioned, but because one cannot 
take a suitable square root of a matrix with negative eigenvalues. It is false for 
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spaces with definite signature, if one introduces the metric (or its conformal 
part) as an additional field on which the spinor transformation may depend, 
and lets the transformation depend nonlinearly on the metric. If one aims to 
"extend" a representation, why not extend it using what is needed to make it 
actually work?] Thus spinors which we defined only in an orthogonal coordi- 
nate system cannot be defined in a natural fashion in an oblique coordinate 
system. [It isn't clear what counts as "natural" here, but oblique coordinate 
systems are indeed admissible, as long as a certain matrix does not acquire 
negative eigenvalues. The resulting transformation law is linear in the spinor 
and nonlinear in the conformal part of the metric] |Gel'fand et al., 1963 p. 
252] 

This work appeared before everything about Ogievetsky-Polubarinov spinors except the 
De Witts' precursor |DeWitt and DeWitt, 1952 . 



One can sometimes detect an undercurrent of dissatisfaction with the standard or- 
thonormal tetrad formalism even among its advocates. Lawson and Michelsohn observe 
that 

the bundle of spinors itself depends in an essential way on the choice of rie- 
mannian [sic] structure on the manifold. 

These observations lead one to suspect that there must exist a local spinor 
calculus, like the tensor calculus, which should be an important component 
of local riemannian [sic] geometry. A satisfactory formalism of this type has 



not yet been developed. |Lawson and Michelsohn, 19 89, p. 5, emphasis in 
the original] 

The dependence on the metric leads one not to be surprised by the way that Ogievetsky- 
Polubarinov spinors form a geometric object (up to a sign) only with the help of the 
metric (or rather, its conformal part, as will appear below). Like many authors, Lawson 
and Michelsohn give no indication of awareness of Ogievetsky-Polubarinov spinors. It 
is not difficult to conjecture that this formalism fills the hole sensed by Lawson and 
Michelsohn. 

Representing spinors with respect to coordinates in a curved space-time, which sup- 
posedly was shown impossible in 1929, is precisely what Ogievetsky and Polubarinov did 
in 1965. The key technical assumption to be rejected by Ogievetsky and Polubarinov 
was that the spinor transformation law should depend only on the spinor, not also on 
the metric (or its conformal part, as it happens) Ogievetskii and Polubarinov, 1965 . It 



is quite evident that Cartan had in mind linear coordinate transformations for the spinor 



by itself |Cartan, 1966 . Once one gives up that assumption, which appears to be in- 
troduced implicitly due to understandable lack of imagination, rather than explicitly for 
any defensible reason, one finds that spinors almost fit do into tensor calculus. In par- 
ticular, they fit well enough that they have spinorial Lie and covariant derivatives due to 
their coordinate transformation properties Ogievetskii and Polubarinov, 1965 . Making 
comparison of their spinor formalism to the 1950s-60s literature on geometric objects, 
including nonlinear geometric objects (in which the components in one coordinate system 
depend nonlinear ly on those in another) |Tashiro, 1950 Nijenhuis, 1952 Tashiro, 1 952 



Yano, 1957 , Aczel and Golab,~1 960, S zybiak, 1963 Szybiak, 1966| , explains various fea- 



tures of the Ogievetsky-Polubarinov (henceforth "OP") formalism. The literature on 
nonlinear geometric objects, being motivated more by a quest for taxonomic complete- 
ness than by recognition of interesting examples, fell into neglect with the modernization 
of geometrical language. Ironically, parallel ideas grew up about the same time among 
particle physicists, partly out of the original OP paper, and have flourished under the 
name of nonlinear group representations, where they appear in current gauge theories of 
gravity. 

OP's constructive proof that finite-component spinors can be represented 
in coordinates in curved space-time has made rather less of an impact 
than one might have expected. One finds to this day many eminent au- 
thors who repeat the Weyl-Cartan claim that such a thing is impossible 



Weinberg, 1972, Deser and Isham, 1976 van Nieuwenhuizen, 1981, Faddeev, 1982 



Lawson and Michelsohn, 1989 



Fronsdal and Heidenreich, 1992 



Kaku, 1993 



Bardeen and Zumino, 1994 



Weinberg, 2000 



Fatibene and Francaviglia, 2003 



Ferraris et al., 2003| . What one evidently never finds is any critique of the OP 



formalism; clearly the same lack of imagination is still doing much of the work as did 
it in the 1930s, albeit less understandably. While the necessity of the tetrad formalism 
is still the dominant view, one cannot speak of any debate, but rather a mere failure 
to engage with the relevant literature by the majority and a surprising lack of polemic 
from those acquainted with OP spinors, like ships passing in the night. But 46 years is 
long enough for the neglect of a constructive refutation of a famous no-go theorem. 

A more correct picture is provided by some workers in supergravity (a field where 
Ogievetsky and Polubarinov are well known Ogievetskii and Polubarinov, 1965 ): 

in fact it is impossible to define a field, transforming linearly under the A 
[coordinate transformation] group, which also transforms as a spinor when the 
A's are restricted to represent global Lorentz transformations. It is possible 
to get around this difficulty by realizing the A transformations nonlinearly, 
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but this is not a convenient solution. Gates et al., 1983, p. 234], emphasis 
in the original. 

The reader interested in practical calculations, on contemplating the OP formalism to 
which Gates et al. allude, might well doubt its convenience. On the other hand, some 
worked examples below indicate that practical calculations actually need not be incon- 
venient, as it turns out, because the significant symmetries present in most problems 
addressed with pencil and paper make the formalism simplify. But more to the point, for 
philosophical analysis, it does not matter whether a formalism is convenient for explicit 
calculations by having a pleasing linear form, or instead takes a more demanding nonlin- 



ear form. The philosopher pays special attention to conceptual problems |Laudan, 1977 
an area where the OP formalism has significant advantages and some surprises. Gates et 
al. continue a bit later: 

To solve these problems we enlarge the gauge group by adjoining to the A 
transformations a group of local Lorentz transformations, and define spinors 
with respect to this group. This is a procedure familiar in treatments of 
nonlinear a models. Nonlinear realizations of a group are replaced by linear 
representations of an enlarged (gauge) group. The nonlinearities reappear 
only when a definite gauge choice is made. Similarly here, by enlarging the 
gauge group, we obtain linear spinor representations. The nonlinear spinor 
representations of the general coordinate group reappear only if we fix a gauge 



for the local Lorentz transformations. Gates et al., 1983 p. 234] 



A bit later a remark relevant to the relationship between the OP formalism and the more 
common tetrad formalism appears: 

It is thus possible to gauge away the antisymmetric-tensor part of the vier- 
bein (although not the scalar part) with a nonderivative transformation. To 
stay in this [symmetric] gauge a local coordinate transformation must be ac- 
companied by a related local Lorentz transformation; the Lorentz parameter 



is determined in terms of the translation parameter. Gates et al., 1983, p. 
235] 

The tetrad formalism provides a back door to the OP formalism, locally, by imposing 
the local Lorentz (that is, 0(1, 3))-fixing condition that the components of the tetrad 
form a symmetric matrix, once an index is moved with the matrix diag(—l, 1, 1, 1) (or 
its opposite). Having spinors in coordinates is not impossible; it's just difficult and 
unfamiliar. 
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Mention of the De Witts as providing a useful clue in the 1950s fore- 
shadows a common theme, namely, ambiguity and implicit self-contradiction. 
Many authors, apparently unaware of the OP formalism in most (not 
all) cases, have imposed the symmetric gauge condition on the tetrad 
jlsham et al., 197T 



Isham et al., 1973 



Deser and van Nieuwenhuizen, 1974 



't Hooft and Veltman, 1974, Borisov and Ogievetskii, 1974 




Cho and Freund, 1975 


Hamamoto, 1978, Hamamoto, 1979, 


Boulware et al., 197£ 


) 


Veltman, 1981 


Ivanov and Niederle, 1982 Alvarez-Gaume and Witten, 1984 


I 


3 assarino, 1984 


Fujikawa et al., 1985, Fujikawa et al., 1985 


Choi et al., 199S 




Aldrovandi et al., 1994 


Lopez-Pinto et al., 1995 Schiicker, 2000 Tresguerres and Mielke, 2000 


Bilyalov, 2002 Sardanashvily, 2002 Obukhov and Pereira, 2003 


Vasiliev, 2003 



Tiemblo and Tresguerres, 2004, Kirsch, 2005 Holstein, 2006 Leclerc, 2006a 



Iochum and Schiicker, 2006, Nibbelink et al., 2007 



Those who were unaware of 



the OP spinor formalism and imposed the symetric gauge on the tetrad thus in effect 
partially reinvented the OP formalism. It is often not difficult to find the same author 
both asserting that having spinors in coordinates is impossible and implicitly showing (in 
the same paper or elsewhere) how to do it — by imposing a symmetric gauge condition on 



the tetrad |DeWitt, 1965[ Deser and van Nieuwenhuizen, 1974 , Boulware et al., 1979 



Deser and Isham, 1976, Isham et al., 1971 Aldrovandi et al., 1994 Leclerc, 2006a 



Leclerc, 2006b| . Bryce DeWitt, having provided (with Cecile) perhaps the first substan- 
tial hint toward nonlinear metric-dependent spinors |DeWitt and DeWitt, 1952 , later 
asserted the necessity of a tetrad |DeWitt, 1965] ; still later he retreated, in view of their 
own earlier work, to the convenience claim later made by Gates et al., that a nonlinear 



spinor transformation law is possible but not as convenient as a tetrad |DeWitt, 1967b 
What is perhaps most remarkable in this spectacle of contradictory claims passing like 
ships in the night is how rarely the contradictory views are simultaneously noticed and 
recognized as contradictory, a phenomenon that this paper aims to overcome. Clearly 
the authors who deny the existence of spinors in coordinates, while in effect showing 
how to have them, typically have not realized that they were sketching a formalism that 
is in principle independent of the tetrad formalism with which they started. In effect 
many authors have largely reinvented the OP coordinate spinor formalism, even while 
denying its possibility. Such a phenomenon cries out for attention from historians and 
philosophers of science. Thus far such attention has not been given, apart from the 



promissory note in Pitts, 2006 , where OP spinors are used to escape a counterexample 
to the Anderson- Friedman absolute objects analysis of substantive general covariance 
Anderson, 1967 Thorne et al., 1973, Lee et al., 1974 Friedman, 1983] . Histories of 
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spinors in General Relativity still read as if the theory took its definitive form around 
1930 |Kichenassamy, 1992 Scholz, 2005 . On a somewhat related note to mine, Ne'eman 
and Sijacki discuss the persistence of erroneous non-existence claims for spinors under 



the double covering of the general linear group Ne'eman and Sijaci, 1997 Sijacki, 1998 



While they are more interested in infinite-component linear spinors than finite-component 
nonlinear (in the metric) OP spinors, they make use of the latter as well. Part of the 
dispute, presumably, is verbal, depending on what counts as coupling directly to the 
metric (whether the concomitant r^ u is 'direct' enough and whether the author even 
knows about it or has entertained the idea that it might be a good object in its own 
right), what counts as a tetrad (whether r^ u , being symmetric and numerically equal to 
a gauge-fixed tetrad almost everywhere on the manifold, is 'just' a tetrad), what counts 
as a representation (vs. representation up to a sign, as one often sees with spinors), etc. 

This paper aims to make OP spinors well known. It, moreover, aims to make them 
intelligible. The causal reader who takes up the original OP papers, for example, might 
well be terminally perplexed; the persistent reader is richly rewarded. Apparently nothing 
has yet been written on OP spinors that is relatively easy to understand, well connected 
to traditional tensor calculus concepts, and largely complete. This paper aims to fill 
that gap at least in outline. The precise details, involving specific coefficients of some 
infinite series, will not be considered, however. Instead some conceptual features that are 
by no means obvious on confronting the explicit mathematical form will be emphasized. 
In the original OP paper, besides the once-familiar imaginary time coordinate x 4 = ict 
that takes away the comforting pairing of contravariant and covariant indices, one finds 
perturbative expansions that particle physicists are more likely to countenance than are 
general relativists (and that implicitly privilege approximately Cartesian coordinates over 
other coordinates), matrices that look vaguely like a Lie derivative multiplied by unfamil- 
iar expressions, a gravitational potential obtained by subtracting a constant numerical 
matrix from the components of the metric tensor (on which more below), and, in gen- 
eral, the appearance of strong coordinate dependence in a way that makes covariance 
anything but manifest. Explanatory commentary is somewhat brief, while there is no 
connection drawn to the nonlinear geometric objects literature, which might have been 
invoked to provide a natural home for OP spinors within ongoing research in differential 
geometry in the mid-1960s. Neither do OP show any interest in coordinate transforma- 
tions beyond infinitesimal order, this being a genuine omission, as opposed to the above 
list of primarily pedagogical challenges. There is little interest in the convergence of the 
series expansions, either (c.f. |DeWitt and DeWitt, 195 2 ] ) , another substantive question 
leaving one to wonder whether, for example, polar coordinates are admissible. (They are, 
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as becomes clear with Bilyalov's generalized eigenvalue formalism. In fact, if the metric's 
signature were positive definite, all coordinates would be admissible, as usual in tensor 
calculus.) But the reader who perseveres and ponders the formalism is rewarded with 
deep conceptual insights as well as interesting open problems. 

One problem left open by OP has been taken up by Ranat Bilyalov, namely, finite 
and even large coordinate transformations [Bilyalov, 1992, Bilyalov, 2002 1. He has shown 
that, while any coordinate transformation is admissible near the identity, not just any is 
admissible in the finite case, because not just any coordinate system is admissible. Recall 
that coordinates are always described as ordered n-tuples. Generally no importance 
is attached to the order in the presence of a curved metric (or even a flat metric in 
non-Cartesian coordinates), but there is an order nonetheless. It turns out in the OP 
formalism as elaborated by Bilyalov, due to nonlinearity and the role of the signature 
matrix diag(—l, 1, 1, 1) that treats the 0th coordinate differently with the minus sign 
(here used as an alternative to x 4 = ict), that not just any coordinate can be first; 
sometimes one needs to swap the coordinates, as will be discussed in detail below. Thus 
electrons, protons and the like care about 'time' in a way that photons and gravitons do 
not. There is a great deal of flexibility about what can be a time coordinate, but not 
total flexibility as one might expect from the precedent of tensor calculus or the usual 
(de-Ockhamized) tetrad spinor formalism. 

A transition from the OP spinor formalism to the tetrad formalism would illustrate 



part of the conventional wisdom often associated with Kretschmann Norton, 1993 



namely, that arbitrary coordinates are admissible in any theory with some ingenuity. 
In this case one can have the time coordinate in what used to be the 'wrong' place by 
introducing 6 extra components to build a tetrad, and letting spinors be spinors with 
respect to a new gauge group unrelated to space-time. The new spinors are coordinate 
scalars and so do not care about the order of the coordinates. 

The other part of the conventional wisdom often linked to the name of Kretschmann 
holds that admissibility of any coordinates into any theory supposedly is trivial; one 
needs only to use tensor calculus (e.g., |Earman, 2006| ). That last claim, that it's easy 
because only tensor calculus is needed, is obviously false if one reflects on spinors while 
thinking about general covariance. Having to introduce a tetrad to fill an inadequacy of 
the tensor calculus, as Weyl urges as a theorem, is hardly an instance of easily solving a 
problem using tensor calculus. But reflection on spinors rarely happens in the context of 
discussions of formal vs. substantive general covariance, largely because particle physics 
is wrongly thought irrelevant to space-time theory and most general relativists do not 
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write a lot about spinorsfl 

What of the claim that the admissibility of any coordinates is trivial? It is not 
perfectly clear whether general covariance is a substantive physical claim about a space- 
time theory, or merely a formal feature that any theory, suitably formulated, can have. 
Thus some authors |Bergmann, 1957 , Anderson, 1967, Friedman, 1983, Stachel, 1993 



Norton, 2003, Earman, 2006, Pitts, 2009] have contemplated criteria for distinguishing 



the substantive general covariance that supposedly distinguishes Einstein's General Rel- 
ativity (GR) from pre-GR theories, from the merely formal general covariance that any 
theory presumably can have. 

Here one needs to consider what price is to be paid for such admissibility of any 
coordinates. One can draw an analogy to internal gauge symmetries, such as in 
electromagnetism. One can install various gauge symmetries in theories that natu- 
rally lack them, simply by introducing gauge compensation fields — cheating, one might 
say. If the gauge compensation fields zag when the physical fields zig, so to speak, 
then one has a theory with gauge freedom that is physically equivalent to the origi- 
nal. The most famous example is the Stueckelberg trick in massive electromagnetism 
Ruegg and Ruiz-Altaba, 2004[ |Pitts, 2009]. Massive electromagnetism, usually associ- 



ated with Proca, adds a term —^A^A^ to the Lagrangian density F 2 for Maxwell's 
electromagnetism, with m being the "photon mass." The mass term breaks the gauge 
symmetry. That gauge freedom can be restored through the introduction of a gauge 
compensation field, a gauge- dependent coordinate scalar. The Stueckelberg mass term 
takes the form — — d fl ip){A IJi — d^ip). A gauge transformation of A^ is compen- 

sated by changing ip\ A^ — > A^ + <9 M x ; ip ~ > i 1 + X- Once again it is possible to admit 
'all' descriptions on equal footing, but the artifice of the gauge compensation field ip is 
palpable. (One notices that ^'s Euler-Lagrange equation is redundant and that one can 
choose the gauge ip = without spoiling locality or Lorentz invariance, for example.) 
Were someone to suggest that the availability of the Stueckelberg trick shows that elec- 
tromagnetic gauge symmetry is trivial, because even Proca's theory can be reformulated 
so as to have such gauge symmetry, the palpable artificiality of the gauge compensation 
field ip shows otherwise. To use such fields is to cheat at gauge invariance; the difference 
between the natural gauge symmetry of Maxwell's electromagnetism and the artificial 
gauge symmetry of massive electromagnetism with Stueckelberg' s trick is evident. One 
might propose that having coordinate or gauge symmetry without cheating via gauge 



4 In view of the fact that in quantum field theory, spinors generally are almost anti-commutative 
rather than almost commutative, ideally one should consider anticommuting 'classical' spinors. But I 
will not worry about that additional wrinkle. 
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compensation fields should be considered the substantive notion of general covariance or 



gauge symmetry Pitts, 2009 



While it is true that one can have general coordinate symmetry or some other gauge 
symmetry into 'any' theory, suitably formulated — I won't attempt to specify just what 
theories are being quantified over, but local (not necessarily relativistic) field theories 
would be a good proposal — the cost of installing it can be high. The theory might want 
to teach a contrary lesson, as in massive Proca electromagnetism; it would be perverse 
to insist on using the Stueckelberg formulation on the basis of a supposed generalized 
Kretschmann lesson about gauge freedom. By contrast, it is not perverse, but quite rea- 
sonable (and common), to employ the Stueckelberg formulation for certain calculations 
in quantum field theory, such as establishing renormalizability of massive electromag- 
netism. High energy physicists have deployed considerable ingenuity in developing gen- 
eral formalisms that install artificial gauge freedom, such as the Batalin-Fradkin-Tyutin 
formalism. The de-Ockhamized formalism can be very useful technically, but is quite 
unhelpful conceptually. 

1.2 Tetrad in Symmetric Gauge as Back Door to OP Spinors 

It is somewhat curious that the widely accepted no-go claim about having spinors see 
a curved metric without a tetrad and the explicit refutation by construction in 1965 
have managed to coexist so peacefully for so long. In fact one can find hints of the OP 
formalism even earlier |DeWitt and DeWitt, 1952 Huggins, 1962 . A number of authors 



have unwittingly largely reinvented the OP formalism by imposing the symmetric gauge 
on the tetrad formalism: e^ A = e aM , with the indices moved with r\. (My notation here 
employs an obvious Greco- Roman alphabet correspondence principle. Some authors who 
impose this gauge condition prefer to say that two types of indices have merged. Often 
the OP formalism requires doing things that standard tensorial notations are designed 
to prevent.) This fixes the local 0(1,3) gauge freedom. 

The usual tetrad formalism involves both coordinate transformations and local 
Lorentz transformations, with the resulting mixed transformation rule 

A ~ dx u B A ' ^ ' 

where J^VbcJd = Vab- J a is an arbitrary position-dependent element of 0(1,3). 

To approach the OP formalism, one needs to choose J B in each coordinate system such 
that the tetrad matrix is symmetric, once the Latin index is moved with diag(—l, 1, 1, 1). 
The resulting transformation rule for the symmetrized tetrad is nonlinear: it has a lin- 
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ear factor because of the vector index of the tetrad, but acquires further dependence on 
the tetrad via J%, which is chosen to yield a symmetric matrix in the given coordinate 
system. Gauge-fixing the local Lorentz freedom in a coordinate- and tetrad-dependent 
way yields a symmetrized tetrad with a nonlinear transformation law, which is basically 
the symmetric square root of the inverse metric r^ v . It is easy to do so explicitly in 
1 + 1 dimensions, where there are no rotations and the boosts all commute; remark- 
ably, it seems to be possible to say something explicit in closed form more generally 
|L6pez-Pinto et al., 1995| . 

One then notices that there is no need to introduce the tetrad in the first place; 
why not simply deal with the symmetric quantity instead? That symmetric entity r^ v 
or r^ v itself represents arbitrary infinitesimal coordinate transformations, nonlinearly, 
with only coordinate indices. At some stage one should consider just how generally the 
symmetric gauge can be reached from the tetrad; the answer will determine the admissible 
coordinates in the OP formalism. 

The fact that an orthonormal basis can be topologically obstructed on manifolds with 
a metric shows that the symmetric square root of the metric is conceptually independent 
of a tetrad. As simple the 2-sphere with positive definite metric makes the 

point, with the hairy ball theorem excluding even one nowhere- vanishing vector field, to 
say nothing of a basis. The symmetric square root of the metric is an analytic function 
of the metric, as shown by the perturbative expansion, and thus exists (patched together 
as needed from chart to chart) globally wherever the metric does. Clearly the symmetric 
square root of the metric is far from being 'just' a gauge- fixed tetrad, because the former 
exists even in cases where a tetrad does not. 



1.3 Generalized Polar Decomposition 

Componentwise, one can obtain the symmetric square root of the metric using a gen- 
eralized polar decomposition of the tetrad, treated as a matrix of components. The 
usual polar decomposition factors a positive definite matrix into an orthogonal (ro- 
tation) factor and a symmetric (shear and expansion) factors. (Clearly the expan- 
sion, which corresponds to the volume element in our applications, goes along for the 
ride and can be separated out using a unimodular matrix and a scalar factor. Below 
it will be noted that it cancels out altogether from the Dirac equation for massless 
spinors.) Isham, Salam and Strathdee have invoked the polar decomposition near the 
identity, but without mathematical control over what happens for large transformations 
|Isham et al., 197T Isham et al., 1973 . Progress in applied mathematics and linear al- 
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gebra has largely filled that hole. 

A generalized polar decomposition makes use of a "signature matrix" like rj = 
diag(-l, 1, 1, 1) |Bolshakov et al., 1997[ |Higham, 2003] . A matrix M is ^-orthogonal iff 



M T r]M = 7]. It is called rj- symmetric if symmetric with index moved by 77. Tweak- 
ing some notation a bit, one has the following theorem applicable to generalized polar 
decomposition of a tetrad into a symmetric square root and a boost-rotation: 

Theorem 5.1. If [tetrad component matrix] E e ji^xn anc j ^j^t^j^ nQ 
eigenvalues on the nonpositive real axis, then E has a unique indefinite polar 
decomposition E = QS, where Q is //-orthogonal and S is ^-symmetric with 



eigenvalues in the open right half-plane. Higham, 2003 p. 513] 



One can find the boost-rotation explicitly: Q = E(r)E T r)E)~2 fixes the local 0(1,3) 
transformation. Having made this decomposition of the tetrad E, the OP formalism 
works with S or R = rjS, throwing away the local 0(1,3) factor, which is pure gauge, 
and keeping the symmetric factor, which is the symmetric square root of the metric, a 
physical quantity equivalent to the metric. 

1.4 Identifying and Eliminating Surplus Structure 

One common theme in modern physics is the value of identifying and eliminating surplus 
structure. One class of examples that has come up in both Newtonian and relativistic con- 
texts is the elimination of a preferred reference frame with no observable consequences. 
Newton's absolute space and the aether rest frame are textbook examples of entities 
that, though perceived by their advocates to have some explanatory advantages, have 
eventually been regarded as superfluous and worthy of elimination. James Anderson's ab- 
solute objects analysis of general covariance, for example, has emphasized the importance 
of eliminating irrelevant fields in order to identity absolute objects, find the symmetry 
group, etc. Anderson, 1967 , Thorne et al., 1973 , Lee et al., 1974 Pitts, 2 006 1. 



This work takes notice of two distinct compatible ways of eliminating surplus structure 
from the typical formulation of the massless Dirac equation in a curved space-time. One 
entity that can be eliminated, as was realized by Haantjes and Schouten in the 1930s 
Schouten and Haantjes, 1936 , Haantjes, 1941| but apparently forgotten afterwards in 



the move to modern notation, is the volume element y/—g. Many authors nowadays 
discuss the covariance of the Dirac operator (the left side of the massless Dirac equation) 
under conformal changes of metric, but (almost?) no one points out that one could, 
using a suitable choice of primitive fields, achieve mvariance, in which J— g disappears 
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altogether from the theory. This difference is important for two reasons. First, it is 
widely believed that a shift from covariance under arbitrary coordinate transformations 
to invariant objects in modern differential geometry was a great leap forward. Why 
introduce surplus structure only to show its irrelevance? The general failure to treat 
the volume element that way implies that either the shift from covariance to invariance 
was not a great leap forward, or that modern formulations of the Dirac equation are 
inferior to some from the 1930s in important respects; either answer suggests a reform 
of current practice. Second, frequently one finds authors drawing conclusions showing 
that they do not infer from covariance under conformal rescaling of the metric that the 
volume element is irrelevant. That is evident whenever someone assigns a special status 
to Killing vector fields (fields such that the Lie derivative of the metric tensor vanishes) 
that is not assigned to conformal vector fields (fields such that the Lie derivative of 
the conformal part of the metric, which precisely fixes the null cone, vanishes) — which 
does happen |Kosmann, 1966a[ Kosmann, 1966b Fatibene et al., 1998 Cotaescu, 2 000 



Fatibene and Francaviglia, 2003 . Given the eliminability of the volume element from 



the massless Dirac equation, one no more needs cooperation from sj—g (as in Killing's 
equation) than one needs cooperation from the aether rest frame in special relativistic 
physics. Suitably purged of irrelevant fields, the massless Dirac equation does not know 
(locally!) whether it is in a flat space-time or a merely conformally flat one. 

There is an interesting difference between the two purgings of surplus structure from 
the massless Dirac equation. Neglecting gravitation, the volume element is not varied in 
the principle of least action. Eliminating it therefore eliminates a non-variational field, 
which thus increases the symmetry group of the theory from the symmetries of the metric 
to the symmetries of the conformal part of the metric. In flat four-dimensional space- 
time, that is an increase from 10 Killing vector fields to 15 conformal Killing vector fields. 
The antisymmetric part of the tetrad, by contrast, serves as a gauge compensation field, 
much like the Stueckelberg scalar ip that installs artificial electromagnetic gauge freedom. 
Eliminating the antisymmetric part of the tetrad (as in the OP formalism) thus removes 
the local Lorentz (0(1,3)) symmetry. Removing yj— g makes a real symmetry manifest, 
while symmetrizing the tetrad removes an artificial symmetry. 



1.5 General Covariance and Coordinate Reordering 

General coordinate transformations contain a certain collection of transformations that 
probably no one has ever regarded as interesting. They got into the formalism by acci- 
dent. Indeed perhaps no one has noticed them prior to Ranat Bilyalov's work on spinors 
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|Bilyalov, 1992 , and few have done so afterwards. That collection involves simply re- 
ordering the coordinates. The usual formalisms require that the coordinates be expressed 
in some definite order, but no significance is attached to that order. For convenience we 
can focus attention on Special Relativity for the moment; similar lessons apply in Gen- 
eral Relativity, in many respects. Coordinates are listed as an ordered quadruple. If one 
calls the original coordinates (t, x, y, z) for example, with t being the standard temporal 
coordinate in an inertial frame and x, y, and z being spatial Cartesian coordinates as the 
notation suggests, then the new coordinates after reordering might be (t, x, z, y). In this 
case the reordering of two spatial coordinates has occurred; this change is so trivial that 
one need not make any change to the metrical matrix diag(—l, 1, 1, 1) (with the speed of 
light set to 1) to accommodate it. 

Slightly less trivial is a reordering that involves the time coordinate: (t, x, y, z) might 
be replaced by (x, t, y, z). Now the matrix diag(—l, 1, 1, 1) is inappropriate; the invariant 
interval in the new coordinates is given by diag(l, —1, 1, 1). In short, one needs tensor 
calculus, or a small fragment of it, namely the part that introduces a metric tensor and 
its coordinate transformation law, in order to permit the coordinate reordering. One does 
not need a connection; the Christoffel symbols in the new coordinates are still because 
the new coordinates are linear functions of the old. One certainly could formulate Special 
Relativity in this fashion. Presumably no one would bother to introduce a metric tensor 
simply for the slight gain in generality of admitting reorderings of the Cartesian coor- 
dinates, however. The indefinite signature of the metric, which distinguishes temporal 
from spatial coordinates, is crucial here; no analogous issue arises for positive definite 
metrics. 

I labor the modest point about coordinate reorderings because the freedom to write 
'time' (suitably generalized) as the second, third or fourth coordinate (in four space-time 
dimensions) is what one gives up in the Ogievetsky-Polubarinov (OP) spinor formal- 
ism Ogievetskii and Polubarinov, 1965 Borisov and Ogievetskii, 1974 Bilyalov, 2002 



It seems to me that no one should miss that rather trivial bit of coordinate freedom. 
So if such freedom is excluded by a procedure that is very beneficial overall, the benefit 
greatly outweighs the cost. If that proposal is accepted, then in the presence of spinors 
one should renounce talking about coordinate transformations involving the general lin- 
ear group GL(4, R), whether rigid or position-dependent, in favor of some collection that 
divides out by the coordinate reorderings involving time. The condition for coordinate 
admissibility, as will appear below, thus depends on the value of the metric components, 
assuming indefinite signature. The result will most likely be a Brandt groupoid, a struc- 
ture that would be a group except that not every pair of elements can be multiplied 
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Hahn, 19781 [Renault, 1980 . Of course the infinitesimal transformations and associated 



Lie algebra will not notice the difference; reordering the coordinates is not a change that 
one builds up from infinitesimal pieces. On the other hand, if one wishes to preserve 
the usual sense of general covariance as admitting arbitrary coordinate transformations, 
then this modest freedom to reorder the coordinates is the increase. It is evident that, 



contrary to the usual practice of admitting all possible coordinates |Wald, 1984J , it is 
fitting to take the atlas of admissible coordinate charts to vary with the types of fields 
present: in the presence of spinors and an indefinite metric, one should throw out coor- 
dinate interchanges involving time, whereas in the absence of spinors or with a positive 
definite metric, such transformations are harmless. 

When one contemplates position-dependent coordinate transformations, as one may 
in SR and must in GR, the requirement to keep 'time' first will be somewhat more de- 
manding, in that charts such that the 'time'-like character perhaps passes back and forth 
between different coordinates will be excluded — though pieces without such fluctuation 
will still be admissible. It follows that one does not know whether a coordinate system 
is physically admissible until the dynamical problem is solved. One can admit all coor- 
dinates provisionally, but some prove a posteriori not to reflect the underlying physics 
adequately. 

That arbitrary coordinates are admissible in GR is not merely conventional wisdom — 
it is one of the theory's most striking and famous features. Yet I argue that this is a 
mistake, albeit a small one, in the presence of spinor fields. Of course fermions such as 
electrons, protons and neutrons are hardly negligible or unusual things, notwithstanding 
their scarcity in philosophers' discussions of space-time theory. So any adequate un- 
derstanding of space-time theory must take spinor fields into account; the unfortunate 
neglect of particle physics has helped to keep this important issue from attracting the 
attention that it deserves. 

To understand the good and the bad considerations involved in the usual and by now 
largely uncontroversial admission of arbitrary coordinates, it is helpful to remember an 
argument that Einstein made in 1916 when the issue was novel: 

The method hitherto employed for laying co-ordinates into the space-time 
continuum in a definite manner [yielding observable time or space intervals] 
thus breaks down, and there seems to be no other way which would allow us 
to adapt systems of co-ordinates to the four- dimensional universe so that we 
might expect from their application a particularly simple formulation of the 
laws of nature. So there is nothing for it but to regard all imaginable systems 
of co-ordinates, on principle, as equally suitable for the description of nature. 
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Einstein, 1923[ p. 117, emphasis added] 



Indeed coordinates with quantitative physical meaning are not available with adequate 
generality in the curved space-times of GR. The next crucial step involves inferring the 
nonexistence of something, given the failure thus far (in 1916) to imagine it. This step 
was eminently reasonable in 1916 and indeed for some time afterward; but clearly the 
argument is defeasible. There could conceivably arise some qualitative physical meaning 
that coordinates retain, or usefully may retain, even in General Relativity. Note also that 
the amount of coordinate freedom, one might say the scope of conventionality, is itself 
conventional: having admitted sufficiently general coordinates to cover curved space- 
times, one decides whether or not to impose some restrictions based on whether simplicity 
can be achieved thereby. Of course in SR the quest for simplicity is naturally realized 
using Cartesian coordinates. In GR there seemed to be no analogous simplification 
available, so admitting arbitrary coordinates was the natural conclusion. However, by 
now, plausibly this impossibility of imagining a formulation of the laws of nature that is 
simpler with somewhat restricted coordinates then with arbitrary coordinates has in fact 
been overcome by mathematical developments after mid-century. 

OP spinors, though not always simple in terms of the process of explicit calculation, 
are simple in the sense of substantially unifying spinors with tensors and the like and in 
the sense of allowing spinors to be treated without introducing six extra field components 
and then six extra symmetries to gauge them away. OP spinors simply exclude surplus 
structure, rather than introducing it and then marking it as surplus with an additional 
gauge group. That, surely, is conceptual progress. Whereas the tetrad spinor formalism 
makes spinors behave spinorially in terms of a non-coordinate index on the tetrad e\ 
(here written as a large Latin index rather than a small Greek index), the OP formalism 
makes spinors behave spinorially with respect to coordinate transformations. 



2 Geometric Objects, Especially Nonlinear Ones 



2.1 Geometric Objects 

The development of tensor calculus led eventually to the concept of a geomet- 
ric object [Nij enhuis, 1952 Schouten, 1954 Aczel and Golab, 1960 |Trautman, 1965 



Anderson, 1967 , Golab, 1974| , which received substantially satisfactory definition dur- 
ing 1934-7 |Schouten and Haantjes, 1937| . The tensor calculus, which Weyl and Cartan 
had proclaimed unable to accommodate spinors, had not reached its natural limits when 
they wrote. The theory of geometric objects, which was largely complete in the 1960s 
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in the linear case, describes tensors and more general geometric objects, including con- 
nections, as well as some little known nonlinear entities, some but not all of which are 
equivalent to linear ones. This theory, which was never well known, has been partially 
forgotten in the wake of the modern style, except for a few papers on "natural bundles," 
such as |Nijenhuis, 1972 Fatibene and Francaviglia, 2003 , which are far from exhaustive 
in applications, so some review will be appropriate. 

The following is a typical definition of a geometric object (neglecting some finer 
distinctions |Nijenhuis, 1952| and qualifications Kucharzewski and Kuczma, 19 64|): it 
is, for each space-time point and each local coordinate system around it, a finite or- 
dered set of components and a transformation law relating components in different co- 
ordinate systems at the same point |Nijenhuis, 1952 Kucharzewski and Kuczma, 1964 



Trautman, 1965| . (In more modern terms, it is a global, as opposed to local, section of an 



appropriate bundle, a fact not always helpful in dealing with empty spaces around fluids 
or elastic solids Pitts, 2006] . One winds up having a four- velocity of nothingness, which 
presumably ought to be or arbitrary if it is defined at all. I won't attempt to modify 
the definition to avoid this problem, however.) For example, a tangent or "contravariant" 
vector field v M has the coordinate transformation law 

for obtaining the components in primed coordinates x M ' in terms of unprimed coordinates 
x v at point p. More generally, tensors have linear homogeneous transformation laws: 
v' ~ v, where the usual flurry of indices is suppressed, partly for generality. A linear 
homogeneous transformation law also exists for tensor densities |Anderson, 19 67], for 
which the transformation law involves some power of the determinant of the matrix 
|p^r. Geometric objects, as defined here, are not intended in this tradition to represent 
everything that one might wish to talk about, but only everything that is 'like a tensor 
field' in some initially somewhat inchoate sense. 

If one wishes to take a covariant derivative, one does so using an affine connection, a 
geometric object with a linear but inhomogeneous (one might say "affine") transformation 
law r" ~ r + O(0), where the term abbreviated as "O(0)" depends on the first and second 
partial derivatives of one coordinate system with respect to another (hence vanishing for 
linear coordinate transformations), but not the components T. These are all the geometric 
objects that are in wide circulation. 

In what follows it will be helpful to introduce some slightly unusual notation. There 
are contexts in which constants or constant matrices play a role in the theory of ge- 
ometric objects, such as for certain inhomogeneous or nonlinear transformation laws. 
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The main quantity that will be needed here is the matrix diag{— 1, 1, 1, 1). (Whether 
there are interesting nonlinear geometric objects that do not involve something play- 
ing an analogous role is unclear.) At times this will be written as rj^ or rj^, if em- 
phasizing its similarity to tensors is useful. The key point to emphasize is that it is 
just a matrix, a collection of numbers, namely 1, 0, and —1; it is not a metric ten- 
sor and has no transformation law, but is always and everywhere the same. One other 
occasions it will be helpful to emphasize how this matrix is unlike a tensor, at which 
point using Latin indices will be helpful: t]mn or i] MN . Especially when there are two 
kinds of indices, such as one finds in the tetrad formalism, writing 7] with Latin indices 
will be helpful. Somewhat awkwardly, the theory of nonlinear geometric objects, linear 
for a subgroup, involves cases where Greek coordinate indices and Latin indices merge 



Ogievetskii and Polubarinov, 1965 Cho and Freund, 1975 . In such contexts I will use 



Greco-Roman alphabetic correspondence, as noted above, such that where there is an 
obvious correspondence between a Greek letter (such as a or /i) and a Latin index (such 
A or M, respectively — not worrying about capitalization), such indices can be added or 
contracted. The depths of geometric object theory require overcoming some of the usual 
conventions of tensor calculus that are intended to assist the expression of coordinate 
covariance in more typical circumstances, in order to express coordinate covariance in 
more unusual circumstances. 

A simple but uncommon sort of geometric object is a tensor or tensor den- 
sity less some constant or constant matrix. For example, 7 Miy =def 9^ ~ 
rjftv, where r\^ v = diag(—l, 1, 1, 1), can be used as the gravitational potential 
Ogievetsky and Polubarinov, 1965b Ogievetskii and Polubarinov, 1965 . One readily 



finds the transformation law 



dx a dx 13 



X 



dx^' dx v 



7+Vo 



dx a dx 13 
1 dx^' dx v ' 



(2) 



Another example, a close relative of which has been used in the Hamiltonian treatment 
of General Relativity |Dirac, 19 59 1, is k =def Vh — 1, where \fh is the spatial volume 
element. The resulting transformation rule is 



k' = k 



dx 



dx' 



+ 



dx 



dx' 



In both examples the point is to subtract away a background or 'vacuum' value of a quan- 
tity, its perturbation from the constant value being of primary interest. One might want 
such an entity because, e.g., nonlinear stability theory involves a choice of variables that 
vanishes at the origin Khazin and Shnol, 199 1| , in order that powers of the variables be 
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good indicators of the magnitude of the terms. For approximately Cartesian coordinates, 
the value of the perturbation behaves intuitively. In all cases it behaves as a geometric 
object, inheriting convenient tensorial Lie and covariant differentiation properties, be- 
cause such differentiations tend to shave off terms of zeroth order in the fields from the 
transformation law |Yano, 1957 . Thus these examples have Lie and covariant derivatives 
that are tensorial (a symmetric covariant tensor and a weight 1 density, respectively). 
This example exhibits a moderately useful geometric object that exploits the features of 
the theory that go beyond the usual examples of tensors, maybe tensor densities, and 
connections. 

If one wishes to view geometric objects as 'machines' that one feeds basis vectors into 



slots |Misner et al., 1973] , then these machines involve adding a numerical piece with no 
slots and a piece with two slots in the case of 7 M!/ ; for k one has to add a numerical piece 
and a piece with four slots for a basis. Densities with integral weights can be regarded 
as having still more slots, as powers of a top form or the like. It appears to be mean- 
ingless to feed in a fractional or irrational number of basis vectors, however. Densities 
with non-integral weights are useful. Bryce DeWitt once found that his quantum gravity 
formalism simplified (in the sense of having only finitely many nonvanishing bare vertex 
functions in the absence of sources) using either {—g)^g^ u — rf v or (—gj^gfiu — f]^ u 
as primitive fields DeWitt, 1967a| . The simplest (and only probably extant) deriva- 
tions of various classical theories of massive gravitons also use densities of irrational 
weights jO gievetsky and Polubarinov, 1965b Pitts and Schieve, 2007 Pitts, 20TT] . Den- 
sities with nonintegral weights, if viewed as machines into which basis vectors are fed to 
give numbers as output, can and presumably must be defined as powers of densities with 
integral weights; one is in effect raising an operator to a power, a procedure that is merely 
formal until one feeds the basis in. That is, one must feed in a basis, and only afterward 
raise the resulting number to a nonintegral power. Such an issue arises in defining "half- 
forms" in geometric quantization Kostant, 1974 Woodhouse, 1992] , a half-form being a 
scalar density of weight |. The virtue of half-forms is that the product of two of them 
gives a covariant integrand suitable for integrating to get a scalar; if one wants to gener- 
alize Hamiltonian methods by giving up a polarization into g's and p's (of weight w and 
1 — w, respectively, in the standard Hamiltonian formalism), then one wants w = |. The 
same problem of treating fractional weights arises when one wants to decompose a geo- 
metric object into its irreducible parts, once a standard exercise with the metric tensor 



Anderson, 1967] , where one finds a scalar density giving volumes and a tensor density 



g^u defining the null cone. It is interesting (and convenient) that the coordinate trans- 
formation laws for densities do not make such distinctions between formal non-integral 
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powers of operators for non-integral weights and more straightforward definitions for 
integral weights that a modern-style definition requires. Such bifurcation stands in con- 
trast to tensors, whether covariant or contravariant, which can be regarded as boldfaced 
entities in their own right (without changing the subject to how they act on an arbi- 
trary bit of surplus structure, surplus structure that might not even be globally defined, 
as nonvanishing vector fields are not on a 2-sphere, for example |Spivak, 1979| , and or- 
thonormal bases are not without Stiefel- Whitney class restrictions |DeWitt et al., 1979| ). 
Thus some of the elegance of the modern style is an artifact of unnatural restrictions on 
the entities employed. If one considers the 'other' kind of tensors — pseudoscalars, axial 
vectors, and the like — then one evidently should regard the 'coordinate- free' entity as an 
operator waiting to eat a basis and say whether it is positively or negatively oriented. 
Axial vectors cannot be ignored in particle physics and play a role in supergravity. 



2.2 Nonlinear Geometric Objects 

Besides linear and affine geometric objects, there are geometric objects with nonlinear 
transformation laws: the new components are nonlinear functions of the old components, 
as well as depending on ^r(p) and the like. Iterating the transformation rule, it is easy to 
see that polynomial laws are not an option, because the highest power will keep rising with 



the iteration. Two remaining possibilities (perhaps among others Aczel and Golab, 1960 



Castillo et al., 2005J ) are fractional linear transformations and infinite series; one could 



perhaps also take the ratio of two infinite series. Consider the symmetric square root 
r^p of the metric, defined implicitly in any coordinate system (or any admissible one, a 
question requiring additional attention below) by 

g^u = r m ?] aP rp V) (3) 

where rf 1 ^ = diag(—l, 1, 1, 1) here is, as always in this work, only a matrix, not a 
metric tensor or field of any sort. This entity exists at least in many coordi- 
nate systems, most obviously in those not terribly far from freely falling Cartesian 
coordinates. If one expresses the metric g^ as some perturbation about the matrix 
Va/3 — diag(—l, 1, 1, 1), then one can use the binomial theorem to find an infinite series 
expression for r m . The result (when the perturbation is small enough for the series to 



converge DeWitt and DeWitt, 1952] ) is Ogievetskii and Polubarinov, 1965 



OO 1| 

V = E * ~\vuM - ~ ■ ■ ■ b«> - fact ° rs ° f 9 

= Viw + 2 (ft* _ Vw) ~ g(^° ~ Vti*)v al3 {gpv - Vpv) + ■■■■ (4) 
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The factorial-like expression |!/(| — k)\ can be cashed out in terms of Gamma functions, 
but need not be because it stands for | • (| 



1) • . . . • (i — k + 1). If one seeks the 



coordinate transformation law for ry^, it follows from the metric transformation law for 
g^v, which is linear and well known, and the definition as applied to both coordinate 
systems: g = rrjr and g' = r'rjr'. There is no prime on 77 in the expression for g', because 
rjAB is the numerical matrix diag(—l, 1, 1, 1). Less schematically, one can write 

dx 13 



dx a 



r ap V pa r a p- 



This result is correct in general, but is still somewhat implicit in that r'^ a does not 
appear alone on the left side. Note that the matrix diag(— 1, 1, 1, 1) puts time first; if 
one tries to order the coordinates otherwise, then either one has to replace the matrix 
diag(—l, 1, 1, 1) with something else (an option that does not seem fruitful or economical), 
or one gets perturbations of magnitude ±2 (perhaps disastrous even outside perturbative 
expansions) for no benefit even when there is no gravitational field present. Note that 
Bilyalov presents a theorem |Bilyalov, 1992| regarding the necessity and sufficiency of 
reordering the coordinates, not simply a plausibility argument for its necessity. The 
coordinate reordering is in effect part of the service rendered by his matrix 
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(5) 



which combines a permutation and a reflection. 

It is evident from the definition of the symmetric square root of the metric that there 
is a transformation rule for it, at least between admissible coordinate systems. That fact 
is illustrated in the 'commutative diagram' (with factors of diag(—l, 1, 1, 1) suppressed): 



9 <- 



tensor 



9 



root 



root 



r <r 



But what can be said, if anything, that is explicit and practical about the transforma- 
tion from r„j, to r', labeled as "?" in the diagram? To get a fully explicit transformation 



rule, at least when the perturbative expansion exists, one can write (schematically) 



g'vv 



dx\ J 

rT]r vv 

OX J ''< 



dx 
dx' 



(6) 
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and use (J^) T fr/r-^rj — I as the perturbation in the binomial series expansion. In detail, 
the transformation rule is 



r 



, , dx a a dx p 

fJ,V 



Thus the transformation rule, at least in the perturbative context, is an infinite series 
in even powers of r. The series expansion chooses the root near the identity, that is, 
near diag{— 1, 1, 1, 1); presumably one wants the 'positive' principal square root in all 
other contexts also. Bilyalov's generalized eigenvector formalism works more generally 
|Bilyalov, 2002] , but still requires that 'time' be listed first among the coordinates; other- 



wise negative eigenvalues can appear, yielding a complex square root of the metric. Note 
that such coordinate restrictions do not arise in the positive definite case considered 
by Gilbert and Murray |Gilbert and Murray, 1991 and by Bourguignon and Gauduchon 



|Bourguignon and Gauduchon, 1992| , which seem to be rather rare examples of the con- 
sideration of the symmetric square root of the metric by mathematicians. 

Viewed as machines with slots, nonlinear geometric objects have the somewhat pe- 
culiar feature of having, one might say, 'internal' slots as well as external ones. The 
symmetric square root of the metric has two indices, but its nonlinear transformation 
rule implies a far more intimate relationship with the basis vectors than one finds for the 
metric tensor, for which the basis vectors are purely external. For the metric one can 
write = g(^ji, ^r) = (gf^r) • 9 • feeding basis vectors into the machine means 
merely placing them on the outside of the tensor and taking the inner product. Thus 
one goes from the invariant tensor machine g to its (coordinate) basis components g^. 

It is comparably easy to reverse the procedure using the dual basis dx^ 1 to get from the 
components to the invariant tensor machine: 

dx tl g lxv dx v = dx»—-g- T^dx u = f-g-f=g. (8) 

One can try to devise an explicit analogous construction for nonlinear geometric 
objects like the square root of the metric r^. The result would seem to be the following: 

r = c r] + ag + c 2 gr]g + c 3 gr]gr]g + ... (9) 

for appropriate coefficients q from the perturbation expansion. Here the 0th term has 
no slots, the first term has two external slots, the second term has two external slots 
and, in addition, two internal slots that must be summed over, the third term has two 
external slots and also four internal slots to be summed over, etc., ad infinitum. Viewed 
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as a machine, r has two external slots and infinitely many internal slots. Unfortunately 
this expression diverges. But one can rewrite the binomial series expansion above a bit 
more abstractly: 

OO 1| 

Once again the invariant machine r, now defined convergently, has two external slots and 
infinitely many internal slots. It is straightforward to provide analogous expressions for 
the square root of the inverse metric g, f for the square root of the conformal part of the 
metric, etc. 

From the component standpoint, by contrast, nonlinear geometric objects are tech- 
nically intricate in comparison to linear ones, but not essentially different or ineffable. 
Are nonlinear geometric objects worth the trouble to define? While they do sometimes 
appear in applications |Ogievetsky and Polubarinov, 1965b] , |Tyutin and Fradkin, 1972 



at least some of these applications involve only "quasilinear" geometric objects — that is, 
nonlinear geometric objects equivalent to linear ones |Aczel and Golab, 1960 p. 79]- 
and so could in principle be carried out using only linear geometric objects. The role 
of nonlinear geometric objects in the most economical treatment of spinors is a much 
better, and indeed compelling, reason to study them. 

Thus r^ v is (almost) a nonlinear geometric object, where the "almost" qualifica- 
tion depends on what coordinate charts one admits. It is, or was, a familiar claim 
in geometric object theory that one must specify the admissible coordinate systems 
Kucharzewski and Kuczma, 1964] . Given such wiggle room in a vague definition, one 



could regard r^ v as a geometric object. But above the more common requirement 
|Nijenhuis, 1952 Trautman, 1965| that every coordinate system receive a set of compo- 
nents was employed. By that standard r^ u is not a geometric object due to the restriction 
on admissible coordinates. While unfamiliar, this feature is not a problem. The custom 
of admitting all coordinates is an entrenched habit that makes sense when no motivation 
for a restriction presents itself. But it is not required for any important task in differen- 
tial geometry, such as covering a manifold, defining Lie differentiation, or whatever else 
there might be. The whole of classical differential geometry with a Lorentzian metric 
would survive perfectly intact if the restriction to coordinates such that g^ u r] ua have no 
negative eigenvalues were introduced from the start. 

This phenomenon points to the methodological utility of mathematical vagueness, 
which value is not always appreciated. Some give and take is needed in figuring out what 
definitions are appropriate in view of the objects to which one wishes to apply them. 
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(This point is somewhat akin to taking a Bayesian view of mathematics Corfield, 2001| or 
crafting a human-faced Bayesianism |Hacking, 1967| .) Imagine a rigorous mathematician 
with precise definitions and deductively valid arguments, who requires geometric objects 
to admit all coordinates (as usual). The mathematician then proves, let us suppose, that 
there is no symmetric square root of the metric that is a geometric object. Such a result 
is, strictly speaking, true; it also might seem rather important, much as the Weyl-Cartan 
theorem seemed quite important. But the sensible conclusion to draw is that geometric 
objects as defined are not quite the right tool for the job. The problem is not that r^ u is 
defective, but that the definition of geometric object was wrong, that is, inappropriate 
for the intended objects of study. There isn't anything important (other than admitting 
arbitrary coordinates, the importance of which is at issue) that one could do with a 
geometric object that one cannot do with r^, which has a Lie derivative in virtue of its 
transformation properties near the identity. But if one is overly impressed with precision 
and deductive rigor and employs them in a context where they turn out to be premature, 
one might charge ahead with the usual definition and fail to notice that the theorem 
holds due to a minor technical point irrelevant to the substance of classical differential 
geometry. Schouten, coinventor of the affine connection, held that most mathematicians 
(except a few "children of the gods" ) make poor physicists, because they pay too much 
attention to beautiful constructions and too little attention to experiments. He claimed 
not to have "tried to do any physics himself" regarding the problem at hand, involving 
conformal transformations, "[bjecause he is convinced that this is a job only physicists 
can do properly." Schouten, 194 9] While his point is not exactly mine, clearly Schouten 
would have recognized that it would be a mistake for physicists and philosophers to 
think that progress in mathematics was automatically progress in physics and philosophy, 
because mathematicians have different aims and criteria. Classical component-based 
differential geometry has some useful vagueness here. Imposing a restriction on admissible 
coordinates in a classical context is a much smaller change, and more obviously irrelevant 
to every result in classical differential geometry, than is the corresponding modern task. 

Of course there is a role for mathematical rigor; sometimes one really needs to know 
which results are proven theorems. There are also occasions where one needs to treat 
problems globally, at least in General Relativity, on which occasions introducing local 
coordinates is clearly unhelpful. But the principle of being able to walk before running 
applies: one needs to have the local problem under control before tackling the global 
problem. Surprisingly enough, the local spinor problem was not understood properly 
(without a lot of surplus structure and a gratuitous gauge group) by anyone (except 
maybe the precursors noted above) prior to 1965, notwithstanding authoritative state- 
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ments from Weyl, Cartan, inter alia, and remains so even for most authors today. Thus 
the application of precision and deductive rigor turns out to have been premature. 



2.3 Are Coordinates Surplus Structure? 

The reader might have noticed a contrast between some attitudes suggested here and the 
common view that coordinates are an annoying piece of surplus structure now already 
successfully overcome and eliminated by modern differential geometry. This modern 
view is easier to hold if one follows the common practice of failing to study nonlinear 
geometric objects. But coordinates are no panacea; the rise of modern techniques was 
not a fall from classical paradise. Coordinates are an occasionally tedious but quite 
flexible bookkeeping device. Of course there is no point in introducing coordinate basis 
components in contexts that are already handled as well or better without them. Global 
results for topologically nontrivial manifolds will be difficult or impossible to come by 
without modern 'coordinate- free' techniques. Nontrivial bundles over topologically trivial 
manifolds, which might arise even in alternative gravitational theories where space-time 
is globally quite boring, also benefit. 

Whether one ought to avoid coordinates and when depends on whether one can do 
without them everything important that one can do with them, whether doing without 
them requires introducing an alternative irritant that might be worse, and whether co- 
ordinates can be introduced or are already present and hence unavoidable at some level. 
In the present state of mathematics, at least, the following statements appear to be true. 
First, coordinates are already present in the usual definition of a manifold and thus appar- 
ently are unavoidable, hence not yet shown to be surplus structure. (It seems unhelpful 
to avoid talking about coordinates by refusing to assign meaning to a word — "manifold," 
"smooth," or the like — that traditionally has a precise and important meaning.) Second, 
refusing to talk about coordinates either blocks the introduction of a Lie derivative or re- 
quires active diffeomorphisms. Third, active diffeomorphisms presuppose a substantival 
notion of manifold point individuation at odds with that required to free General Relativ- 
ity from indeterminism along the lines of Einstein's relational-flavored point-coincidence 
argument as a response to the hole argument (see, e.g., |Norton, 1993| ). If coordinates 
cannot be avoided without giving up manifolds, if Lie differentiation requires mention 
of either coordinates or active diffeomorphisms, and if active diffeomorphisms require an 
un-General Relativistic notion of point identity that motivates indeterminism, then it is 
by no means yet clear that one always ought to try to avoid coordinates. 

Defining Lie derivatives requires that something move — either coordinates move rel- 
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ative to space-time points (passive coordinate transformations), or fields move relative 
to space-time points (active diffeomorphisms) . Something needs to change in order to 
represent the group (oid) of (more or less) arbitrary transformations. If nothing changes, 
then a fortiori nothing changes like a vector, or a 3-form, or the like. In a few simple 
cases one might hit upon the Lie derivative even without any unifying principle — e.g., the 
commutator of two vectors; but neither the full meaning of the construction nor anything 
like adequate generality would be realized. One sometimes sees treatments of differential 
geometry that deal with invariant bold-faced objects (largely avoiding coordinates), mod- 
eled after vector calculus, and also avoid active mappings [Young, 1993 Simmonds, 1994 



In such work covariant differentiation appears to arise due to ordinary partial differen- 
tiation of an invariant entity built using both components and coordinate basis vectors; 
differentiation of the coordinate basis vectors is supposed to give rise to the Christoffel 
symbol terms. That seems very satisfactory. But that cannot be what is really hap- 
pening, because the connection is an independent field; evidently one already knows in 
advance what the basis vectors are, namely, coordinate gradients of the position vector 
x, which involves reference to a flat metric. Of course x does not exist relative to a 
curved metric, so the technique is rather restricted in application. If neither the coor- 
dinates nor the fields move, there evidently is no concept of a Lie derivative, and hence 
no notion of a Killing vector field or the like, and no adequate way to treat space-time 
transformations (on curved manifolds), which are associated with conserved quantities 
Bergmann, 1958| . That being unacceptable, one needs for something to move, either 



coordinates in a passive transformation of labels, or fields in an active diffeomorphism. 

If one treats OP spinors using active diffeomorphisms, one will encounter the analog 
of the coordinate restriction to keep the components of g^vrf 01 from having negative 
eigenvalues. Thus arbitrary diffeomorphisms, though customary, are excluded. One could 
learn to live with that change. But parts of modern mathematical literature would need to 
be reworked to avoid error or irrelevance. By contrast, in classical coordinate techniques, 
it is obvious that complete generality of coordinates, as opposed to arbitrariness near the 
identity, for example, does little or no work. 

Active diffeomorphisms presuppose a mathematical metaphysics of point individua- 
tion of a rather absolutist or substantivalist sort. Mathematical points have haecceities, 
primitive identities not tied to what happens at the points. One can mail the fields else- 
where while leaving the points behind; the points don't go with the fields. That is almost 
precisely the denial of the point of Einstein's point-coincidence argument, which played 
a crucial role in rescuing generally covariant field equations from indeterminism. Thus 
one is left to worry about indeterminism again, as in the 1980s resurfacing of Einstein's 
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hole argument Earman and Norton, 1987 . While various strategies for dealing with the 
hole argument have been deployed, a natural and largely satisfactory approach is simply 
to refuse to build mathematics on a metaphysical foundation at odds with that of our 
best current theory of gravity, General Relativity. General Relativity being the spur that 
made differential geometry so dominant a part of 20th century mathematics, this tacit 
repudiation of its lessons regarding point individuation is ironic, driving a wedge between 
mathematical points and physical space-time points. 

Mathematicians as such need not worry much about the plausibility of their man- 
ifold point individuation assumptions. There are plenty of mathematically interesting 
manifolds that are not even intended, by virtue of dimension, metric signature, lack of 
a metric, or something else, to pertain to space-time. Perhaps only if mathematicians 
wish for their manifolds to pertain to space-time does the metaphysics of manifold point 
individuation rise above the level of convention. At that point they become in effect 
physicists and/or philosophers and new criteria emerge. If one can find a treatment that 
avoids the surplus structure of a misleading metaphysics of points and that allows one 
to express and calculate everything that one needs without using coordinates, then the 
promise of modern differential geometry for physicists and philosophers will be achieved. 

2.4 Conformal Group Yields Linear Transformation Law 

Besides the full nonlinear transformation law for (nearly) general coordinate transfor- 
mations, it is of interest to ascertain under what circumstances the transformation law 
is linear — that is, to find out what is the stabilizer group. The answer turns out to 
be the 15-parameter ( ( n+1 K"+ 2 ) m n dimensions) conformal group, as will now appear. 
That group contains the Poincare group of special relativistic field theory, a fact that 
explains why one only sees a linear transformation law in special relativistic field theory 
in Cartesian coordinates. Besides the very familiar four (n) rigid space-time translations 
and six ( n ^ n ~^ ) familiar Lorentz transformations (three ( n2 ~^ n+2 ) rotations and three 
in — 1) boosts), the conformal group Fulton et al., 1962] involves a scale transformation 



x^' = sx^ (where s is a real number, nonzero and positive without loss of generality) and 
four (n) "acceleration" transformations 

^ = x_±a 

1 + 2a^ri^x u + a 2 x 2 v ' 

where indices are moved with the matrix diag{— 1, 1, 1, 1) and 

x >i r] iW x u . (In n dimensions, one can append or remove l's and/or — l's along the 
diagonal as needed.) These groups are formal in the sense that, whether or 
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not the initial coordinates are approximately Cartesian, the same functional form 
of the transformation is used. Making contact with particle physicists' work on 
nonlinear group representations, which grew partly out of the OP spinor formal- 



ism (see also Isham et al., 1971 Borisov and Ogievetskii, 1974 Cho and Freund, 1975 



Miglietta, 1979, Ne'eman and Sijaci, 1979 ), one can call the conformal group the stabil- 
ity group for the nonlinear representation. The symmetric square root of the metric is 
thus a nonlinear representation of the general coordinate transformation group (dividing 
out the coordinate reorderings involving time), with the conformal group as its stabi- 
lizer. This example of a nonlinear geometric object (apart from coordinate restrictions) 
is equivalent to a linear geometric object, namely, the metric (or, for that matter, its 
inverse, or various densitized metrics) and so is quasilinear. Quasilinear geometric ob- 
jects have the advantage of providing a bridge between linear cases where everything is 
familiar, such as how to take a Lie or (when defined) covariant derivative, and nonlinear 
cases where much is different. 

It is plausible by inspection, and rigorously true, that the conformal group is exactly 
the group that leaves invariant the conformal metric density fj^ (which determines the 
null cones) , if one pretends to have a conformally flat null cone structure in the theory. 
(There is no such structure here, only the matrix diag(—l, 1, . . . , 1), but a useful trick 
results nonetheless.) This fact, along with strategic insertion of identities in the nonlinear 
transformation law, will yield the linear transformation law for the conformal group. 
From above, the nonlinear transformation rule is 
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(11) 



The quantity in parentheses, as a set of numbers, is formally equal to the components 
of a flat inverse conformal metric density f) pu , a tensor density of weight ■= (or more 
generally, -), under a transformation from Cartesian coordinates to some new coordi- 
nates. (This is just a formal mathematical trick, because only the numerical matrix 



33 



diag(—l, 1, 1, 1) is present!) But in the special case where the transformation is in the 
(15. or KhlKli±g) -parameter) group of conformal transformations, the parenthetic quan- 
tity simplifies dramatically: 
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(12) 



(with no primes because conformal transformations are symmetries of the entity in ques- 
tion), where i] Xt is just the matrix diag(—l, 1, . . . , 1) as usual. Installing this simplification 
into the transformation rule, one encounters the square root of a perfect square: 
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(13) 



the symmetric square root of the metric transforms as a tensor density of a certain weight 

under the conformal group, and hence as a tensor under the Poincare group. (Note that 

for spherical polar coordinates, for example, the perturbative expansion fails, but the 

transformation nonetheless makes sense, as one sees easily by inspection.) 

The transformation rule for the symmetric square root of the conformal metric density 
i_ 

f'fiu = r iiv\f--g n is similar, but with modest changes to ensure preservation of the anti- 
unit determinant condition jf^j = — 1. For general coordinate transformations one has 
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For the conformal group one has the simplification 
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a tensor density of weight — - under the conformal group; this is just the weight required 
to preserve the anti-unit determinant condition [r^l = —1. One notices that f MV , or 
rather its inverse f^, is the quantity to which spinors most readily couple, once one has 
pared away the fat of the asymmetric part of the tetrad and the volume element. They 
are usually retained as though they couldn't be eliminated, giving 16 components (n 2 ) 
where only 9 components ( n2+ 2 n ~ 2 ) are needed, an excess of 7 pieces of surplus struc- 

-n+2 



ture in 4 dimensions or 



extra pieces in general. Evidently modern aspirations 



to avoid surplus structure in differential geometry have not been fulfilled in the more 
common spinor formalism. Ogievetsky and Polubarinov noted only the linearity under 



the Poincare group Ogievetskii and Polubarinov, 1965 , not the full 15-parameter con- 



formal group, but in Ogievetsky's later work the conformal group increasingly appeared 



Ogievetsky, 1973 Borisov and Ogievetskii, 1974 . 



2.5 Linear Spinor Transformation for Conformal Group 

Above it has been shown that the symmetric square root of the (or its inverse) metric 
has a transformation law that, for the special case of the conformal group, is linear. 
The square root of the metric is interesting, for our purposes, not primarily as an end 
in itself; after all, it is equivalent as a geometric object (apart from the question of 
admissible coordinate systems) to the metric, that is, quasilinear in the sense of Aczel 
and Golab Aczel and Golab,T 960|. (It also appears in various nonlinear gauge theories 
|L6pez-Pinto et al., 1995 , Kirsch, 2005 Ali and Capozziello, 2007| .) Rather, the square 
root of the metric is useful for coupling spinors to a curved metric; at that point one 
leaves the realm of quasilinearity for an essentially nonlinear geometric object, or at any 
rate an entity with an essentially nonlinear transformation law (along with coordinate 
restrictions and spinorial two-valuedness) . 
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That the spinorial coordinate transformation rule is linear for the 15-parameter con- 
formal group can be seen in two ways. One way, suitable for infinitesimal transformations, 
is to use the explicit form of the Lie derivative given by Ogievetsky and Polubarinov 
Ogievetskii and Polubarinov, 1965 . This expression involves, as one would expect, a 



transport term involving the generating vector field and the coordinate gradient of the 
spinor, a term that specifies a Lorentz transformation out of the curl (the antisymmetric 
part of the gradient) of the generating vector field, and a novel correction term involving 
the symmetrized gradient of the generating vector field, which term depends nonlinearly 
on the symmetric square root of the metric. While initially forbidding, this novel term, 
on continued contemplation, gives up some of its secrets. The new term, which is what 
makes it possible to take the Lie derivative of a spinor field (which, one recalls, is widely 
believed to be impossible), involves a commutator of the square root of the metric with 
the symmetrized coordinate gradient of the generating vector field. Lie differentiation for 
spinors with respect to an arbitrary vector field is not impossible, nor even fundamen- 
tally different from Lie differentiation for (nonlinear) geometric objects |Yano, 1957 ; it 
is merely difficult and unfamiliar. On account of the commutator, the new term vanishes 
if the symmetrized gradient is proportional to the identity matrix, or, giving up their use 
of an imaginary time coordinate, proportional to r] MN . But that condition is simply the 
conformal Killing equation, if one treats the matrix Tj MN formally as an inverse metric. 
Hence the conformal Killing vector fields of the 'metric' rj MN (not g^ u ) annihilate the 
correction term. 

If one wishes to give the spinor density weight w (| being the proper weight 
to achieve a conformally invariant Dirac operator in four space-time dimensions), 
then an extra term +wip^^ appears, as is usual for Lie derivatives of densities 



Schouten, 1954[ Ogievetskii and Polubarinov, 1965 | Anderson, 1967[ |Israel, 1979|; lin- 



earity is not threatened. 

For finite conformal transformations (even those not connected to the identity), in 
the sense of coordinate transformations such that there exists some (positive) function 
F such that 
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7} 



MN 



dx a dx? 

the transformation law is still linear. One can factor out the determinant to get 



dx a 



dx' 



dx 



V 
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dx 



n dx" 
dx 13 
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MN 



(16) 



(17) 



dx^_ 

dx a 



dx' 
dx 



" is therefore a Lorentz transformation (possibly varying from point to point). 
The whole conformal transformation is therefore the product of a local stretching and a 
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local Lorentz transformation. Making such a transformation on a tetrad thus stretches 
and Lorentz-transforms the world (coordinate, Greek) index in accord with the vector 
(or covector) coordinate transformation law. Assuming that the tetrad was initially 
symmetric (so as to give a transformation of the square root of the metric under coor- 
dinate transformations), one can recover symmetry by making the same local Lorentz 
transformation on the local Lorentz (Latin) index, so that both indices receive the same 
treatment. It does not matter what the value of the square root of the metric is; only its 
being symmetric matters. The local Lorentz transformation acting on the Latin index is 
thus determined wholly by the coordinate transformation, not by the (conformal part of 
the) metric. Thus nonlinearity is averted for the transformation of the symmetric square 
root of the metric for these conformal coordinate transformations. 

Whereas the spinors accompanying a tetrad are coordinate scalars and are spinors 
under (the double cover of) a distinct local 0(1, 3) group, Ogievetsky-Polubarinov spinors 
have a distinctive spinorial coordinate transformation law that reduces to the usual one in 
particle physics in appropriate special cases. One can derive the Ogievetsky-Polubarinov 
spinor transformation law by fixing the local 0(1,3) gauge freedom (given a coordinate 
system) to make the tetrad components symmetric (after an index is moved with t)mn)', 
the corresponding spinor transformation for that element of the local 0(1,3) group is 
then applied to the spinor, giving a coordinate- and metric-dependent boost-rotation to 
the spinor as its coordinate transformation law. For the special case of the conformal 
group of coordinate transformations, because the need to preserve the symmetry of the 
tetrad (that is, the square root of the metric) determines the local Lorentz transformation 
that determines (up to a sign) what happens to the spinor field, it follows that the spinor 
transformation is also independent of the metric, and is merely linear in the spinor. That 
is just what one expects from experience with spinors in Cartesian coordinates in Special 



Relativity, where one makes only Poincare transformations Kaku, 1993 , a subgroup of 
this conformal group. 

2.6 Necessity and Sufficiency of Coordinate Reordering: Ex- 
plicit Treatment in Two Dimensions 

One powerful way to take the symmetric square root of the metric is to use a generalized 
eigenvalue formalism, where one has both the metric g^ v and a Lorentzian matrix 7]^ v = 
diag(— 1, 1, 1, 1) |Bilyalov, 1992 Pitts and Schieve, 20 04 1. It is convenient, though not 



necessary, to factor out the volume element g =def det^^) by working with the conformal 
metric density g^ u , which has determinant —1. Due to the indefiniteness of both g^ v and 
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rj^, the usual theorems about a complete set of real eigenvalues with orthogonal and 
complete eigenvalues do not apply in general. 

There are several things that can go 'wrong' in this generalized eigenvalue for- 
malism, compared to the familiar case where one of the matrices is positive def- 
inite or one takes the ordinary eigenvalues of a symmetric matrix. One is that 
the eigenvalues need not be real. Another is that there need not be a complete 
set of eigenvectors due to nontrivial Jordan blocks. In four dimensions, one can 
have four real eigenvalues with four independent eigenvectors, or four real eigen- 
vectors with three independent eigenvectors, or four real eigenvalues with two inde- 
pendent eigenvectors, or two real and two complex conjugate eigenvalues with two 
real and two complex conjugate eigenvectors |Petrov, 1969 , Hall, 1984b Hall, 1984a 



Hall and Negm, 1986 Hall et al., 1996 Bilyalov, 1992 Kopczyhski and Trautman, 1992 



Bilyalov, 2002, Pitts and Schieve, 2004| . It turns out that neither complex eigenvalues 



nor nontrivial Jordan blocks with the associated shortage of eigenvectors cause any prob- 
lem. The only problem arises if some eigenvalue(s) is negative, because then the square 
root is not real. (Given the metric signatures, there will be either or 2 negative eigen- 
values.) Negative real parts for complex eigenvalues do not pose a problem; one can take 
their square roots to lie in the right half plane and have a positive real sum. To find the 
symmetric square root of the metric, one next finds the eigenvectors, which come as a 
pair of complex conjugates if the eigenvalues do (along with the real pair from the two 
suppressed dimensions). One can find the square root of the metric by taking the square 
root of the eigenvalues and choosing the roots to lie in the right half-plane so that the 
square roots are themselves complex conjugates, and thus obtain a real symmetric square 



root Higham, 1987, Bilyalov, 1992, Higham, 1997, Bilyalov, 2002 Higham et al., 2005 



As Higham succinctly puts the matter, 'Any matrix with no nonpositive real eigenvalues 
has a unique square root for which every eigenvalue lies in the open right half-plane." 



Higham, 1997| The matrix for which one wants a square root is g^r] 1 "* or the like, in 
each coordinate system. It is also connected to the identity t/mn, that is, the 'positive' 
or principal square root. Having no negative eigenvalues is not merely sufficient, but also 
necessary for a real square root determined by the metric: "Let A e M. nxn be nonsingular. 
If A has a real negative eigenvalue, then A has no real square roots which are functions of 



A." | Hi gham, 19 87| A square root that is not a function of the original matrix (a tetrad, 



for example is such a square root of the metric) contains surplus structure. It appears 
not to be possible to let the square root of the metric go complex (in fact, imaginary) 
while maintaining useful degrees of differentiability. 

In two dimensions one readily solves for the eigenvalues using the quadratic formula. 
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Consider the eigenvalue problem 



g 00 - At? 00 g 01 
gOi g ii _ Xr] u 



-A 2 + v,^A 



Here r]^ = diag(—l, 1). One readily finds that 



A _ ± y^ggg - 4 (ig) 

One readily sees that if ly^g^l < 2, then the eigenvalues and hence the eigenvectors are 
complex, giving a real square root of the (conformal part of the) metric. If rj^g' 11 ' > 2, 
then the eigenvalues are positive and again the square root is real. But if rj flu g fJ ' l/ < 
—2, then the eigenvalues are negative and the square root is not real, which is bad. 
One therefore has, in two space-time dimensions, the following necessary and sufficient 
condition for admissibility of coordinates: 

% „g^ > -2. (20) 

(This condition is equivalent to rj^g^ > —2, as one would hope.) This of course holds 
neighborhood by neighborhood; a coordinate system that violates this condition in only 
some places is admissible in those places where the condition hold. Thus it is not the 
case that all coordinate systems are admissible, and the general linear group GL(2,M) 
is not appropriate: it is not the case for admissible coordinates that the derivative of 
one coordinate system with respect to the other at a point is arbitrary. Because r/^ is 
indefinite, nonlinear group representations have to care about the minus sign, or, one 
might say casually, about which coordinate is time. Thus electrons know what a time 
coordinate is, as do all other spinorial entities. The standards for being time are extremely 
lax, but they are not trivial. 

But one notices that by interchanging the coordinates x° and x l , and thus inter- 
changing g 00 and g 11 , flips the sign of ri liu g^ v and hence makes an inadmissible coordinate 
system admissible. Thus not any (ordered) coordinate system is admissible, but any 
appropriately ordered coordinate system is admissible. Such coordinate swapping, one 
recalls, is part of the service performed by Bilyalov's T matrix |Bilyalov, 1992 and its 



relatives. In higher dimensions this sort of simple and direct argument has not been given 
and is not trivial. Bilyalov's proof |Bilyalov, 1992] takes a somewhat different form. 

Obtaining negative eigenvalues, then, is simply a sign that one has proposed as a 
time coordinate what is really (from the perspective of nonlinear geometric objects linear 
for the conformal subgroup) a spatial coordinate. Merely reordering the coordinates 
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at the point solves the problem. If the coordinate transformation is affine, so that its 
gradient is constant, then this move at a point in fact suffices globally. If the coordinate 
transformation is not affine, then reordering the coordinates at a point will tend in some 
cases to reduce the size of admissible coordinate charts, because a coordinate chart that 
has time in the first position in some parts of space-time and not in the first position in 
other parts of space-time will need to have the latter regions truncated. This is really 
not a problem, though it violates the modern habit of admitting all possible coordinate 
systems a priori and never reconsidering them later, a habit that was not so entrenched 
in the classical literature on geometric objects |Kucharzewski and Kuczma, 1964] . 

One might wonder why this coordinate restriction apparently has gone un- 
noticed, at least prior to Bilyalov's work. The answer, or much of it, is 
that previous writers on nonlinear representations of (a sufficiently large piece of) 
GL(4,M) that are linear for the Lorentz subgroup have been content to work near 



the identity Ogievetskii and Polubarinov, 1965 Isham et al., 1971 Isham et al., 1973 



Borisov and Ogievetskii, 1974 Cho and Freund, 1975 Borisov, 1978 1. This has been ev- 



ident in most cases from their explicitly saying so, their working infinitesimally, or their 
achieving finite transformations by exponentiating an infinitesimal transformation. 

2.7 Differentiation of Nonlinear Geometric Objects 

Nonlinear geometric objects have not been investigated in much detail by differen- 
tial geometers or general relativists. Ironically, particle physicists hit upon the idea 
of nonlinear group representations at about the same time (1960s) that the newer so- 
called coordinate-free notion of differential geometry helped to divert mathematicians' 
interests away from the idea of geometric objects. Some important results were ob- 
tained for Lie and covariant differentiation of nonlinear geometric objects, however 
|Tashiro, 1950 Tashiro, 1952 Yano, 1957 Szybiak, 1963 Szybiak, 1 966 1. Though typi- 



cally one figures out the form of the covariant derivative by hand, there is in fact a gen- 
eral formula for figuring out the covariant derivative, when it is defined — which it isn't 
if the geometric object in question has a transformation rule involving second deriva- 
tives, as connections do. The same holds for the Lie derivative (which is always defined), 
including that for a nonlinear geometric object. Coordinate transformations near the 
identity suffice |Sz ybiak, 1963 Szybiak, 1966| , which explains why the usual ("polar" 



|Jackson, 1975] ) vectors and axial vectors have the same Lie and covariant derivative 
formulas, despite doing different things under time reversal or spatial inflection; by the 
same token, the difference between scalars and pseudoscalars disappears infinitesimally. 
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More generally, objects which impose some restrictions on coordinate transformations 
far from the identity nonetheless have well defined Lie and (where appropriate) covariant 
derivatives. 

For nonlinear geometric objects, a general result is that the Lie and covariant 
derivatives themselves are not geometric objects, but the Lie (or covariant) derivative 
forms a geometric object with the object itself |Tashiro, 1950 Tashiro, 1952, Yano, 1957 



Szybiak, 1963 Szybiak, 1966] . If \ is a nonlinear geometric object, then only the pair 



(x, C^x) is a geometric object — that is, has a transformation rule giving the components 
in one coordinate system in terms of the components in the other coordinate system — 
not C^x alone. The reason is that the Lie derivative in one coordinate system depends 
on both the Lie derivative and the original field in another coordinate system. The same 
holds for covariant derivatives (unless special circumstances make noxious terms vanish, 
as will occur in part for OP spinors): (x, Vx) is a geometric object, but Vx alone is not 
Yano, 1957 Szybiak, 1963 Szybiak, 1966| . An exceptional case is when Vx vanishes, 



for example. A relevant partial instance is the spinorial case when one can write x as 
(g^v, t/>), Vg vanishes, and the transformation rule for ip is linear in ip (though dependent 
on g), as will appear below. The fact that an Ogievetsky-Polubarinov spinor forms (up to 
a sign) part of a geometric object with (the conformal part of) the metric overcomes Car- 
tan's "insurmountable" |Cartan,T9 66| difficulties in covariantly differentiating of spinors 
using the coordinate techniques of classical differential geometry. 

2.8 Differentiating OP Spinors as (Almost Part of) Nonlinear 
Geometric Objects 

One often reads that spinors do not admit Lie differentiation, unless the gen- 



erating vector field is a conformal Killing vector field Benn and Tucker, 1987 



Penrose and Rindler, 1986, p. 101]. Unlike other spinors, OP spinors do in fact have 
a classical Lie derivative, as one would expect from their having a spinorial coordinate 
transformation law and no additional gauge group. Because the spinor forms only part of 
a geometric object (or something close enough), with the remainder being the symmetric 
square root of the metric r^ u or the like, one in fact must consider a pair such as (r^ u , ip) 
as a candidate for Lie differentiation. Thus (r^ u ,ip, C^r^, C^ip) is a geometric object 
(apart from coordinate restrictions and spinorial double- valuedness) . Thus one obtains 
the desirable property that the commutator of two Lie derivatives is the Lie derivative 
with respect to the commutator of the generating vector fields. 

Here two simplifications are possible by making use of the theory of nonlinear ge- 
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ometric objects, which Ogievetsky and Polubarinov do not. First, r^ v and are 
equivalent as geometric objects (at least for admissible coordinate systems for r^ u )— 
that is, each is an algebraic function of the other, and that function is the same in all 
coordinate systems. One can therefore use the pair (g^u,^) instead of (r^,^); while 
the spinor transformation law still depends on the metric and in a nonlinear way, at 
least the nonlinearity of the transformation rule for r M „ is averted. The fact remains 
that (g^u, ip) has a nonlinear transformation law because the spinor ip has a nonlin- 
ear metric-dependent transformation law, given in detail (at least infinitesimally) by 



OP Ogievetskii and Polubarinov, 1965 . Thus a geometric object that contains the 
Lie derivative of the spinor field is (g^^ip, C^g^y, C^ip) . In fact ip is redundant due 
to the linearity of transformation rule in if), so the relevant geometric object is merely 
(g^v, C^g^u, ^ ne transformation rule for Cgij) depends on g^ v without depend- 

ing on C^g^ u , so one finally has the geometric object {g^, C^ip) for the Lie derivative of 
the spinor. The second simplification will appear below. 

Note that just as the covariant derivative of the metric with respect to the metric- 
compatible connection vanishes, the symmetric square root of the metric inherits the 
same property: Vg^ u = -B- Vr^ = 0. One can see that result by taking 

by the Leibniz rule, recalling that V a r) pa = because r] pcr is a constant matrix, and notic- 
ing that the remaining terms give 40 equations related to 40 unknowns by a nonsingular 
linear transformation. A perhaps even more straightforward derivation that Vr Ml/ = 
uses the series expansion in equation (j3J) for r MP in terms of g^ u and r] pcr and notes that 
every factor in every term on the right side has vanishing covariant derivative. Such a 
derivation applies, at least, whenever the series expansion holds. But for any manifold, 
there is around any point a neighborhood where the series expansion converges. Even for 
nonlinear geometric objects, the vanishing of a covariant derivative is an invariant notion 
|Szybiak, 19 63 1. So this derivation by the series expansion works in general. 



3 Conformal Invariance of Dirac Operator with 
Densities: No Volume Element if Massless 

A second simplification is possible, one not suggested by OP originally (but 
Ogievetsky, 1973 Borisov and Ogievetskii, 1974| ), but suggested by the con- 



sec 



formal covariance of the Dirac equation Infeld and Schild, 1946 Hitchen, 1974 
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Choquet-Bruhat and DeWitt-Morette, 1989[ |Bourguignon, 1993| |Branson, 2005|. It is 



not often noted (except in Schouten and Haantjes, 1936, Haantjes, 1941 Peres, 1963] in 
four dimensions), but is quite true, that suitable use of tensor densities and densitized 
spinors can make the Dirac equation (absent a mass term) and its Lagrangian den- 
sity manifestly conformally invariant. (It is mysterious that the MathSciNet review of 
([Ha antjes, 1941| ) fails to notice the novelty of Haantjes's excluding the volume element 
using weighted spinors.) The right choice of variables makes conformal transformations, 
which alter the volume element, do nothing at all to the variables actually used; the 
volume element disappears. The OP coordinate transformation rule for spinors does 
not depend on the volume element y/—g, but only on the conformal part of the metric, 
g^u, a tensor density. One can split the metric into irreducible pieces, those being the 
conformal part and the volume element Peres, 1963 Anderson, 1967 Katanaev, 2 005 1 , 
using the relation g^ v = g^uy/^g 3 ; one sees that \g^ u \ — — 1- This decomposition recently 
proved crucial in finding a counterexample to the Anderson-Friedman absolute objects 
program, in that GR itself turns out to have an absolute object in the volume element 
Pitts, 2006], |Giulini, 2007|. One can make a similar decomposition of the symmetric 



square root of the metric, or of its inverse r^ v . To avoid the proliferation of symbols, 
I will express the determinant of the square root of the metric in terms of g; for the 
conformal part with anti-unit determinant, I again award a hat on top. A relevant result 



is r 



fJ,U 



-g 



While neither the elimination of the tetrad (saving 6 components) 



Ogievetskii and Polubarinov, 1965 nor the elimination of the volume element 



(saving 1 component) |Schouten and Haantjes, 1936 |Haantjes, 1941[ |Peres, 1963| 
is very new, it is surprising how little known either procedure is. Still more rare, 
and probably novel, is the combination of the two. The closest work that comes to 



mind |Hamamoto, 1978 Ne'eman and Sijaci, 1979, Ivanenko and Sardanashvily, 1983 



Ne'eman and Sijaci, 1988, Boulanger and Kirsch, 2006| exponentiates a symmetric 
traceless entity, which many of the authors call "shear" by obvious analogy to 3- 
dimensional continuum mechanics, and get something like the square root of the 
conformal part of the metric. But getting that entity leaves unresolved whether it can 
be used, without \J—g in the picture, to write the massless Dirac equation. 

One payoff of using only the symmetric square root of the conformal metric density, 
rather than a tetrad, is that it is clear by inspection that the symmetry group in (confor- 
mally) flat space-time is the 15-parameter conformal group. With the tetrad legs present 
and supposedly needed, one might be reluctant to trust the local Lorentz invariance to 
conclude rigorously that only the metric, not the tetrad, must be considered. With the 
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volume element present, even with a transformation rule under conformal rescalings, it 
again takes at least some thought to realize that the volume element does not affect the 
symmetry group. But with the surplus structures eliminated in favor of r^, one needs 
only to use the fact that C^r^ = is equivalent to C^g^ u = and the fact that the latter is 
the conformal Killing vector equation (with the surplus volume element purged) , to infer 
that the 15-parameter conformal group is the group of symmetries of the non- variational 
object in the Lagrangian density in the conformally flat case. 

To avoid surplus structure, one wants spinors, the square root of the metric, and 
densities together. The Dirac operator 7 A e^V^ that acts on spinors is conformally co- 
variant |Choquet-Bruhat and DeWitt-Morette, 1989 Branson, 20 05 1. One can show us- 



ing densities (including densitized spinors) that there is a conformally invariant Dirac 
operator lurking in the vicinity; the appropriate spinor turns out to have weight | 
in four space-time dimensions or, more generally, in n space-time dimensions. 
That conformally invariant Dirac operator is j^f^V ' u ipw, where 7^ denotes a set of 
numerical Dirac matrices, is the symmetric square root of the inverse conformal 
metric density g^, V„ is the Ogievetsky-Polubarinov covariant derivatives for spinors 



Ogievetskii and Polubarinov, 1965 with the density weight term (with the weight al- 



tered to match the usual western rather than Russian conventions), and ip w is a spinor 
with weight w = No use is made of any scalar density in defining this operator, so it 
is a concomitant of just the weighted spinor and the conformal metric density. It turns 
out that the Lagrangian density in this formalism with the densitized variables is also 
manifestly conformally invariant in any dimension, because 

C = V^jflrfv^Vrf = Vw^V M Vv (21) 

yj—g does not appear even algebraically in the latter expression. (Analogous re- 
sults hold for the conformally covariant coupling of scalar fields (c.f. |Wald, 1984 



Choquet-Bruhat and DeWitt-Morette, 1989] ): using scalar (density) fields with weight 



\/—g does not appear in the Lagrangian density.) It is noteworthy that for 



2n 

(co)vector fields, taking the electromagnetic vector potential to be a weight covec- 
tor also achieves conformal invariance, at least in four dimensions. One could, if one 
wished, raise the index and/or densitize the vector potential and spoil that invariance 
if one wished, leaving mere conformal covariance; but no one does so because the most 
naive case is also the nicest. Much of the apparatus of conformal rescaling of fields 
|Wald, 1984[ |Choquet-Bruhat and DeWitt-Morette, 1989] can be avoided by the use of 



suitably weighted densities, which automatically depend on the metric in the appropri- 
ate way (or so one assumes initially, before taking the densities as primitive) and do not 



44 



change under conformal rescalings. 



4 Differentiating OP Spinors with Conformal Invari- 
ance 

Ogievetsky and Polubarinov show that the Lie derivative for a spinor under coordinate 
transformations contains a new metric-dependent term, linear in the gradient of the 
generating vector field and highly nonlinear in r^ v . If one introduces the decomposition 
r tiv _ f.nv^j_g -j j n ^ their equations 22 and 23 (of which the latter ought to have a 
factor of ip at the end), one can show that a change of integration variable from a to 
cty/—g ~5 makes \/—g disappear from their expression A and hence from the infinitesimal 
coordinate transformation formula and Lie derivative altogether. This is not surprising: 
besides the conformal covariance, which we saw to admit reformulation to achieve con- 
formal invariance, one also notices that the introduction of a local 0(1, 3) group is served 
as effectively with a densitized tetrad e\ of weight —4; as a matrix, this tetrad has 15 
independent components, the determinant being 1 and the volume element having can- 
celled out. The local 0(1,3) (local Lorentz) index A, not the coordinate transformation 
properties, is doing the work here, so altering the coordinate transformation properties by 
a choice of tensor density weight makes no difference. Spinors, at least in the tetrad and 
OP formalisms, are deeply tied to the conformal part of the metric, but not closely tied to 
the volume element. Thus instead of an arbitrary nonsingular asymmetric 16-component 
4x4 matrix of tetrad components, only a symmetric 9-component (anti-)unimodular ma- 
trix of components of the square root of the conformal metric density is required for the 
Dirac equation (apart from a mass term, which is trivial in its spinorial transformation 
properties). 

Thus to form a geometric object (up to a sign and with time listed as the first 
coordinate, as usual) involving an OP spinor ip, one needs not the full metric tensor 
g^ u , but only the conformal metric density g^ u , which of course is equivalent to f^ u . 
(One could view these as alternate coordinatizations of the same fiber.) Lack of con- 
formal mvariance has led several authors to the mistaken conclusion that merely con- 
formal Killing vectors are less special than are Killing vectors, as far as the massless 
Dirac equation is concerned |Kosmann, 1966a" Kosmann, 1966b Fatibene et al., 1998 



Cotaescu, 2000 Fatibene and Francaviglia, 2003 . But that cannot be the case, because 



one cannot need the cooperation of an absent volume element any more than one can 
need the cooperation of the aether in Special Relativity. 
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The entity (g^ u ,ip) still has a nonlinear transformation rule for ip, but gratuitous 
nonlinearity and the irrelevant volume element have been pared away. Using the 
transformation rules for Lie and covariant derivatives for nonlinear geometric objects 
C^Xi Vx, for x |Szybiak, 1963[ |Szybiak, 1966| , one has, schematically, for the com- 



ponents in the primed coordinate system in terms of those in the unprimed system, 
{C^ip)' ~ {^jC^ip + -Tj^—C^g^ . One notices that C^g^ u = is the equation for ^ to be 
a conformal Killing vector field. (Nowadays one usually sees the conformal Killing equa- 
tion written as C^g^ v ~ g^ iu , thus introducing gratuitous reference to a volume element; 
the shoe of surplus structure again is on the other foot, that is, the modern foot, not the 
classical foot ,^|) One sees that the Lie derivative with respect to conformal Killing vector 
is nicer, because the second term ^-C^g^ u then disappears. But the Lie derivative exists 
for any vector field, not just conformal Killing vectors relative to the metric. It is evident 
that failure to include the (conformal part of the) metric with the spinor will lead to 
problems suggesting that spinors have Lie derivatives only for conformal Killing vector 
fields, just as one often hears |Penrose and Rindler, 1986 , Benn and Tucker, 1987 . One 



thus explains a standard difficulty in spinor formalisms and overcomes it using the OP 
formalism. 

For covariant derivatives, the situation is similar, but not identical: 



Using the metric compatibility of the covariant derivative (with the appropriate extra 
term for a density of weight — | |Schouten, 1954[ |Anderson, 1967[ |Israel, 1979| ) 



the unwanted new metric term disappears. It therefore follows that Vip can be a spinor- 
covector, as OP announce that it will be. (g^, V^) is a geometric object, ip itself does 
not appear due to the linearity in ip of the transformation rule for ip, while Vg M „ doesn't 
appear because it vanishes. One could not impose that V^> be a spinor-covector without 
disposing of the new metric term — which would not be possible if an arbitrary connection 
were employed to take a covariant derivative. 



5 What is the Lie derivative of an equivalence class? This formula gives an answer in a specific case. 
Siwek gives a general answer to this question |Siwek, 1965a , albeit one that he misapplies in the case 



of the object of Pensov |Siwek, 1965b| by neglecting the coordinate dependence of a certain scale in 
defining an equivalence relation. But why take the Lie derivative of an equivalence class when the 
geometric object g„ v is available? 
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5 Spinors, General Covariance, and Anderson- 
Friedman Absolute Objects 

An adequate account of general covariance must take spinor fields into account, because 
clearly there are electrons (or at any rate lumps of excitation of the electron field), and 
clearly in some (perhaps less fundamental) sense there are protons and neutrons as well, 
to say nothing of the less famous cousins posited by current physical theories and in 
many cases discovered in particle accelerators and the like. Whereas Friedman's account 



of general covariance and absolute objects does not include spinors Friedman, 1983 
Anderson's book does include spinors. Just how his treatment of spinors relates to his 
treatment of absolute objects is not terribly clear, however. Although Anderson expected 
a correlation between whether a field counts as absolute or not (in terms of his criterion 
of gauge equivalence in all models) and whether it is varied in the least action principle 
to get Euler-Lagrange field equations, it is not clear what variational principle would 
apply. Perhaps Heller and Bergmann's treatment Heller and Bergmann, 1951] , which 



Anderson did not mention but probably knew, would work, though its extra Lagrange 
multipliers are unappealing and savor of the irrelevant fields that Anderson's formalism 
bans | Anderson, 1967 Thorne et al., 1973 Lee et al., 1974 . More promising is the rather 



standard orthonormal tetrad formalism. 

One major problem with the orthonormal tetrad formalism in the context of An- 
derson's absolute objects analysis of general covariance is that the tetrad includes a 
geometric object that is the same (up to gauge equivalence) in all models, an absolute 
object |Pitts, 2006] , which is just the sort of thing that substantively generally covariant 
theories are not supposed to have, according to Anderson. Some time ago the Jones- 
Geroch dust counterexample was proposed as a counterexample to Anderson's analysis 
Jones, 1981b, Jones, 1981a, Friedman, 1983] . The 4- velocity of dust is a tangent vec- 



tor that is nonzero wherever there is dust. This proposed counterexample is incorrect 
|Pitts, 2006] , because an absolute object must be the same in all models (diffeomorphi- 
cally equivalent between arbitrary neighborhoods), and dust has models with holes, yield- 
ing neighborhoods full of dust and neighborhoods that are empty. But the mathematical 
fact that nonvanishing tangent vector fields are all alike up to diffeomorphisms remains. 
(In fact there is a broader class of such objects, including tangent vector densities of any 
weight except 1, first analyzed by Andrzej Zajtz |Zajtz, 1988| and later independently 



rediscovered by Robert Geroch in discussion with the author Pitts, 2008 1.) Whereas 
there possibly being holes in the dust makes the the Jones-Geroch dust example go away, 
the time-like leg of an orthonormal tetrad cannot ever vanish anywhere in any model. It 
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therefore is just the sort of counterexample that the Jones-Geroch dust 4-velocity field 
was believed to be. If a tetrad is necessary for coupling spinor fields to a curved metric, 
then the existence of electrons generates a counterexample to Anderson's analysis of gen- 
eral covariance: GR + electrons has an absolute object and hence violates Anderson's 
conception of substantive general covariance. A tetrad leg, given coordinate and gauge 
freedom, can have components (1, 0, 0, 0) in some neighborhood about any point. 

The solution to this problem, as observed previously |Pitts, 2006 , is to note that 
an orthonormal tetrad is not necessary for coupling spinor fields to a curved metric, 
because the OP spinor formalism lets spinors see the (square root of the) metric directly, 
without any orthonormal tetrad in the theory. Six components of the 16 in the tetrad 
are irrelevant (leaving aside the volume element). In the OP spinor formalism, these 
irrelevant bits are removed. Thus there is no tangent vector anywhere in the theory, once 
the surplus structure is removed. 

As it happens, Geroch and independently Giulini have observed that the volume ele- 
ment y/—g in GR serves as a counterexample to Anderson's analysis of general covariance 
|Pitts, 2006 Giulini, 2007 . Thus the viability of Anderson's analysis seems not to turn 
ultimately on a correct treatment of spinor fields. Evidently a correct analysis of substan- 
tive general covariance has not been found, or at any rate Anderson's rather attractive 
and widely known analysis is not it. 

But in fact the mild coordinate restrictions required by OP spinors implies that even 
the formal or trivial sort of general covariance, supposedly involving the admissibility of 
arbitrary coordinates, is problematic, contrary to widespread belief. If my expectation 
that no one would bother to introduce a metric tensor in SR in order to admit swapping 
the coordinates t and x is accepted, then it is quite unclear why one should introduce 
a tetrad instead of just a metric — 6 irrelevant field components — in order to remove the 
OP restriction that time be listed first. 



6 OP Spinors, Conservation Laws, and Gravitational 
Energy Localization 

It has been generally thought necessary to treat spinorial energy-momentum in a 
very different fashion from the energy-momentum of fields that are tensors (or ten- 
sor densities, connections, or the like), on account of the absence of a Lie deriva- 



tive for spinor fields for an arbitrary vector field M0ller, 1961, Fatibene et al., 1998 



Fatibene and Francaviglia, 2003| . But now that a Lie derivative for spinor fields is known, 
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it follows that the bulk of the usual treatment of conserved quantities in terms of Lie 
differentiation also applies to spinor fields |Bilyalov~ 992, P itts, 201~0] . There are a few 
technical complications arising from the nonlinearity of the (almost) geometric object 
(g^u, but these are manageable and appropriate reflections of the character of spinors 
under coordinate transformations. It was shown recently that the usual objections to 
the localization of gravitational energy in terms of lack of a transformation law — i.e., 



failure to be a geometric object — do not withstand scrutiny Pitts, 2010 . The desire for 
such a transformation law can be justified only on the assumption that the conserved 
quantities in different coordinate systems are equivalent, because that is just what trans- 
formation laws imply. "Physical meaning" neither entails nor is entailed by a trans- 
formation law. But such equivalence is an arbitrary assumption. There are infinitely 
inequivalent symmetries |Bergmann, 1 958 1; why should one expect inequivalent symme- 
tries to give equivalent conserved quantities? Why should one be bothered, for example, 
that the gravitational potential energy, involving the square of the coordinate gradient 
of the metric components, is highly nontrivial for spherical coordinates in Minkowski 
space-time (Bauer's objection |Bauer, 1918[ |Pauli, 19 21 1 )? Though 'nothing happens' 
regarding the gravitational field, a great deal happens with the coordinates, making 
the relation between the two vary strongly with position; the Noether treatment entails 
that the pseudotensor involves a relation between the geometry and the coordinates. 
One can overcome the relation to a coordinate system by admitting all coordinate sys- 
tems, obtaining a pseudotensor in all coordinate systems, an infinite-component entity 
|Pitts, 2010| . Such an entity is, in a technical sense of classical differential geometry, an 
"object" |Nijenhuis, 1952 , p. 28] (in the sense of having a set of components at every 
point in every coordinate system about that point), though not a geometric object, due 
to its lack of a transformation law to make the components in different coordinate sys- 
tems equivalent. Christian M0ller's fourth criterion, that the gravitational energy density 
be a scalar density for purely spatial coordinate transformations in order to make the 
energy in the laboratory room invariant |M0ller, 1961 , appears to be motivated only by 
wishful thinking or an analogy defeated by Noether's theorem. Hence a treatment in 
terms of Noether's theorems and Lie differentiation faces no significant objections. With 
OP spinors admitting a Lie derivative, spinor fields can be included without conceptual 
difficulty. Without OP spinors, one could not give such a unified treatment of energy- 
momentum localization and conservation in GR involving both bosonic and fermionic 
fields. 
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7 Spinors and the Partial Conventionality of Simul- 
taneity 

The conventionality of simultaneity has been a longstanding issue in the philos- 
ophy of physics |Reichenbach, 1928 , Griinbaum, 1973] . The question has arisen 



whether spinor fields pose distinctive issues for the conventionality of simultaneity 



Zangari, 1994 Gunn and Vetharianam, 1995 , Karakostas, 1997 Anderson et al., 1998 



Bain, 2000 Jammer, 20 06 . Zangari claimed that spinors require standard simultaneity 
and hence refute the thesis of the conventionality of simultaneity |Zangari, 1994] . (Here 
I do not have in mind questions of definability |Malament, 1977| , but only the logi- 
cally prior question of the range of coordinate freedom pertaining to simultaneity choices 
|Havas, 1987 | .) 

Such a result would be a conceptual triumph if its technical foundation were correct, 
but Zangari's conclusions were refuted by Gunn and Vetharianam, who noted that Zan- 
gari had chosen an inappropriate representation of space-time coordinates and that an ap- 
propriate representation permits nonstandard synchrony conventions. Gunn and Vethar- 
ianam employ a purpose-built generalization of the Dirac equation from the usual special 
relativistic form, without relating it to any standard formalism (though one can show 
that it has such relations). Karakostas, while disagreeing on many technical points with 
Zangari and providing a more adequate technical basis, defends an anti-conventionalist 



thesis akin to Zangari's Karakostas, 1997 . Part of Karakostas's work involves contact 
with the orthonormal tetrad formalism, according to which spinors transform as coor- 
dinate scalars — that is, do not transform at all — and are spinorial with respect to an 
unrelated group that double-covers the local 0(1,3) group under which the local Lorentz 
indices (which I have generally written with capital Latin indices) transform. One might 
expect the result of such a formalism, pace Karakostas, to be that spinors as such are 
irrelevant to the conventionality of simultaneity because spinors as such are coordinate 
scalars and hence do not even notice the choice of synchrony convention. 



Bain takes a nuanced position that leaves room for conventionalism Bain, 2000 . In 
particular, he refutes some of Karakostas's criticisms of Gunn and Vetharianam, but takes 
issue with their move that arguably trivializes the issue of spinors and conventionality by 
their breaking the link not simply between spinors and coordinates, but between spinors 
and space-time transformations of whatever sort. From a mathematical standpoint, Gunn 
and Vetharianam's treating spinors as coordinate scalars — and presumably as spinors 
with respect to an unrelated 0(1,3) group — is unexceptionable. (One readily sees that 
their formalism is a special case of the orthonormal tetrad formalism where one ignores 
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the locality of the 0(1,3) group and refers only implicitly to a global 0(1,3) subgroup 
to identify ip as a spinor. In standard coordinates their tetrad is simply the Kronecker S, 
while their modified 7 matrices absorb the transformed tetrad. Thus they pedagogically 
ignore all irrelevant complication to focus on the issue at hand.) Bain's hesitation about 
their separating spinorhood from space-time stems from surprise at spin's being a key 
spacetime-related feature of elementary particles in Special Relativity and yet unrelated 
to spacetime (similar to the internal gauge symmetry of Yang-Mills fields) in General 
Relativity. Such a view of surprise was expressed long ago by Cartan himself: 

Certain physicists regard spinors as entities which are, in a sense, unaffected 
by the rotations which classical geometric entities (vectors etc.) can undergo, 
and of which the components in a given reference system are susceptible 
of undergoing linear transformations which are in a sense autonomous. See 
for example L. Infeld and B. L. van der Waerden .... It is clear that this 
impossibility [of having finite-component spinors with a covariant derivative 
in the classical Riemannian sense] provides an explanation of the point of 
view of L. Infeld and van der Waerden . . . , which is however geometrically 
and even physically so startling. Cartan, 1966 pp. 150, 151] 



Detaching spinorhood from spatial coordinate rotations and the like was, to Cartan long 
ago as to Bain, quite a surprise-but an appropriate response to the impossibility of 
including spinors as such within the realm of coordinate transformations and covariant 
derivatives with Christoffel symbols, which Weyl and Cartan supposedly showed. It is 
to Bain's credit that, at so late a date, he could still recover the surprise expressed by 
Cartan when the (supposed) impossibility was more novel. 

Due to the work of Ogievetsky and Polubarinov, that impossibility is overcome. Car- 
tan's and Bain's preference for unification, which they thought unsatisfiable, is adequately 
realized using OP spinors, for which the group that makes ip a spinor is the (double cover 
of) a Lorentz group that is a subgroup of the space-time coordinate transformations. OP 
spinors, unlike the spinors of the tetrad formalism, are thus a strict generalization of the 
spinors in Minkowski space-time in Cartesian coordinates, such as one finds in standard 
particle physics books |Kaku, 1993 Peskin and Schroeder, 1995 1. While the answers re- 



garding spinors and the conventionality of simultaneity have varied considerably, and all 
the previous answers have flaws, the question is a very welcome one in the sense that it 
provides a rare instance of consideration of the relevance of electrons, protons, and the 
like to philosophical questions about space-time theory. 

OP spinors are friendly to conventionalism about simultaneity, but not for reasons 
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that have appeared previously. Anti-conventionalists Zangari and Karakostas have held 
that spinors as spinors ought to be compatible with nonstandard simultaneity if a con- 
ventionalist thesis is to be upheld, and have denied that spinors as such are compatible 
with nonstandard simultaneity. Conventionalists Gunn and Vetharianam have thought 
it sufficient that spinors be present in a formalism admitting arbitrary coordinates, or at 
least coordinates arbitrary enough to admit nonstandard simultaneity conventions of the 
sort usually employed in this context. That spinors as such are coordinate scalars and 
hence wholly indifferent to simultaneity conventions is good enough for conventionalism, 
according to Gunn and Vetharianam. To Bain, such a move trivializes the anticonven- 
tionalist point. But in the OP formalism, which generalizes special relativistic theory as 
economically as possible (without the surplus structure of a tetrad and a new gauge group 
to deprive the extra fields of physical meaning) , spinors do transform spinorially (not triv- 
ially) under coordinate transformations and do admit nonstandard simultaneity choices, 
as will appear below. The anti-conventionalist's point is not trivialized, as Bain worried, 
but met head-on with unfamiliar technical results. A great variety of time coordinates 
is permitted, including all of them usually considered in the debate over the convention- 
ality of simultaneity debate and some that are not. That is not, however, because any 
coordinate whatsoever (nor even any 'flat' one linearly related to standard coordinates 
in Minkowski space-time) is admissible as time; in fact some coordinates are inadmissi- 
ble as time coordinates in the OP spinor formalism. The conventional Kretschmannian 
wisdom is false (at least unless one inflates the formalism with 6 extra gauge compen- 
sation fields in the form of a tetrad). The dividing line is apparently unprecedented 
in other formalisms, not corresponding, for example, to the Hilbert-M0ller proposal to 
make the time and space coordinates bear obvious temporal and spatial relationships to 
the null cone |Hilbert, 2007 Pauli, 1921 M0ller, 1952| . Instead of the nonlinear metric- 
independent transformation rule typical of OP spinors, for standard simultaneity a linear 
metric-independent transformation rule arises, because transformations between stan- 
dard coordinate systems lie within the 15-parameter conformal group for which linearity 
was proven above. 

Let the transformation from standard to nonstandard simultaneity coordinates be 
given by 

= (x° + (2ei - l)x x + (2e 2 - l)x 2 + (2e 3 - l)x 3 , x\ x 2 , x 3 ). 

One sees that standard simultaneity is the case e\ = e 2 = e 3 = |. It is convenient to 
define 

n = (2e 1 -l,2e 2 -l,2e 3 -l), 
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so then 



x 



Applying the tensor transformation rule to a rank 2 contravariant tensor, one finds the 
inverse metric for Minkowski space-time in with nonstandard simultaneity to be 
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(22) 



These components have determinant —1, so there is no difference between the metric and 
its conformal part in these coordinates. Let us calculate the symmetric square root of 
the metric r ,fll/ , which satisfies 

r'^rj NA r'^ = cp". (23) 
One way to do the calculation is to use the binomial series expansion 



Ogievetskii and Polubarinov, 1965 . Another way, which I will employ here, is to use 
a generalized eigenvalue formalism. It is convenient to ignore two spatial dimensions and 
replace n with one number n; there is no loss of generality, because one could rotate any 
n to be along the x-axis. The generalized eigenvalue problem can be written as 



g'^ - Xr] 
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One readily finds that 
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(24) 
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(25) 



Returning to the specific form of the eigenvalues, one sees that near the origin the 
eigenvalues tend to be complex, but farther from the origin (|n| > 2), the eigenvalues are 
real and negative. It follows that coordinates with |n| > 2 are inadmissible. Coordinates 
with |n| < 2 are permitted. The final result is 
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Recalling that n = 2e — 1 and that the usual range of e in discussions of the conventional- 
ity of simultaneity is between and 1, it follows that the usual range of n is between — 1 
and 1. Thus the Ogievetsky-Polubarinov formalism permits — 2 < n < 2, considerably 
larger than the typical — 1 < n < 1 of the conventionality of simultaneity discussion, but 
less than the full Kretschmannian arbitrariness expected based on experience with tensor 
calculus. The table compares the values discussed in the Reichenbachian conventionality 
of simultaneity literature, the values shown here to be admitted in the OP spinor for- 
malism, and the values permitted by the Kretschmannian conventional wisdom that any 
coordinates are admissible. 



Reichenbachian 
Simultaneity Conventionality 


Ogievetsky-Polubarinov 
Spinor Formalism 


Kretschmannian 
Coordinate Arbitrariness 


< e < 1 


4 <e <f 


any e 


-1 < n< 1 


-2 < n< 2 


any n 



Of course if one were prepared to interchange the order of the coordinates, one could 
admit any values for e and n values in the OP formalism. But such a move steps out- 
side the traditional parameters of the conventionality of simultaneity discussion, which 
assumes that one already knows which coordinate is time. 

Recalling that the tetrad formalism involves making a coordinate transformation one 
the world (Greek) index and a Lorentz transformation on the local (Latin) index, one 
can calculate the finite Lorentz transformation needed to symmetrize the tetrad. The 
quantity 



(27) 



is that Lorentz transformation. For the Ogievetsky-Polubarinov formalism, one then 
applies that Lorentz transformation to the spinor field by multiplication by a suitably 
related matrix S. After a few pages one finds that the spinor transformation S is given 
by 



S = I cosh 




(28) 

(This exact expression confirms the result that —2 < n < 2.) In short, a coordinate 
transformation from standard simultaneity to nonstandard simultaneity induces an n- 
dependent boost of the spinor. The dependence on n is inherited from the metric via 
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the symmetric square root of its conformal part. The spinor transformation rule, one 
recalls, is linear in the spinor but nonlinear in the metric. In this simple case the result 
is tractable for explicit calculations. 



8 The Schwarzschild Radius and 'Time' Coordinates 

Part of the lore of general relativists is the role of "Eddington-Finkelstein" coordinates 
in the late 1950s in helping to overcome 'Schwarzschild singularity' at r = 2M; instead 
that radius came to be known as the "horizon" of a black hole, through which one might 



readily enough pass on the way to the curvature singularity at r = Finkelstein, 1958 



Misner et al., 1973| . (Note that I am neither endorsing nor criticizing the details of this 



informal textbook history; the point at hand is philosophical.) Recalling such history, 
one might well conclude that holding too tightly to an association between a coordinate 
and some qualitative temporal or spatial character played a negative role in the context 
of discovery for the significance of r = 2M, a role only overcome with difficulty using 
infalling coordinates that made no such qualitative associations^ Does the OP formalism 
un-learn lesson of Schwarzschild radius by re-regimenting coordinates as temporal and 
spatial? 

The infalling Eddington-Finkelstein coordinates are a radial coordinate r, a null co- 
ordinate V, and two angles 9, <fi. The metric's line element is given by 

ds 2 = - ( 1 - — j dV 2 + 2dVdr + r 2 (d6 2 + sin 2 6d<p 2 ) 



Misner et al., 1973, p. 828]. A null coordinate such as V is, in some rough sense, 
half spatial and half temporal, r seems quite unambiguously spatial. One can use the 
generalized eigenvalue formalism to ascertain the coordinate ranges of admissibility for 
these coordinates. The results are surprising in more than one respect. One can show 
that the OP admissibility range of (V, r, 6, <fi) (noting the order) is for r > ?jf. That 
seems plausible enough — something strange happens somewhere inside the Schwarzschild 
radius, but at least one can get inside it before having to switch coordinates. That fact 
alone indicates that the OP 'time' coordinate restrictions do not unlearn the lessons about 
coordinates from 1958 that contributed to the modern understanding of black holes. At 
any rate one can get inside the horizon, and there is no reason to assume that r > j s 
a real barrier, so why not keep going with some other coordinate system? 



6 I thank Charles Misner for suggesting this sort of question. 



55 



One can also consider the OP admissibility of (r,V,8,<p), with r coming first. This 
is, intuitively, the 'wrong' order, because a spatial coordinate is playing the role of OP 
'time,' while a null (half time, half space) coordinate is playing the role of space. One 
might expect this coordinate system not to be admissible, or to be admissible only in 
some small exotic region, such as inside the horizon. But on doing the calculation, one 
finds these coordinates to be admissible for r > 0! The eigenvalues are complex. As 
noted above, complex eigenvalues cause no trouble because they permit a real square 
root with eigenvalues in the right half of the complex plane. Thus for r > ^ there is 
more than one right order: (V, r, 8, 0) and (r, V, 8, 0) are both admissible. The relation 
between the spinor components in the two systems is presumably quite nontrivial, in 
contrast to the relationship between the metric tensor components. 

In short, there are cases in which more than one right order exists, and cases where an 
intuitively wrong order is admissible and an intuitively more right order is inadmissible. 
What electrons know as 'time' need not be unique or intuitive. Clearly OP spinors keep 
their own counsel in defining what counts as 'time;' electrons don't so much know what a 
time coordinate is (conforming to some externally prescribed standard) as they stipulate 
what a time coordinate is. All of this follows from theorems in linear algebra about 
matrix square roots and negative eigenvalues, not from intuitions imported by hand. 
The lesson of the Schwarzschild radius is not unlearned by OP spinors. 



9 Taking Stock 

No one, I think, will argue that special relativistic field theory for tensor fields is correct 
when one permits arbitrary coordinates by using a metric tensor (under arbitrary trans- 
formations) and wrong when one uses only Cartesian coordinates. Neither will anyone 
argue that admitting arbitrary coordinates is generically more convenient, though it can 
be in specific applications. Neither will anyone argue that admitting arbitrary coordinates 
is conceptually illuminating; certainly Peter Bergmann would not |Bergmann, 1957 . The 



same presumably holds for gauge symmetry in massive electromagnetism. It isn't cor- 
rect to introduce Stueckelberg's gauge compensation field and incorrect to omit it and 
have no gauge freedom. Neither is it generically more convenient to use Stueckelberg's 
trick, though in some applications it is. Neither is it generically more illuminating to 
use it than to omit it. If anything, the introduction of gauge compensation fields sheds 
conceptual darkness, not light, on theories, by tempting one to overlook the difference 
between merely formal gauge symmetries installed by a trick and substantive gauge sym- 
metries characteristic of General Relativity, Maxwell's (massless) electromagnetism, and 
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(massless) Yang-Mills fields. De-Ockhamization, such as setting 'force' equal to the sum 
of 'gorce' + 'morce,' is, at least defeasibly, not the path to conceptual enlightenment 
IGlymour, 1977] IQuine, 1975 . 



Prior to the OP formalism, one did not know that the tetrad formalism was de- 
Ockhamized. But now one has the Ockhamized formulation. One thereby obtains greater 
unity between bosons and fermions within the theory of nonlinear geometric objects (up 
to a sign, with coordinate restrictions), a unified treatment of space-time symmetries 
in terms of Lie derivatives, and hence a unified understanding of conservation laws. 
Especially for conceptual questions, OP spinors are more illuminating than the tetrad 
spinor formalism. The OP formalism also merits a modern bundle treatment for the 
better treatment of global issues, matters not treated here. Such treatment should begin 
with work by Bilyalov. 

Recalling that the Weyl-Cartan theorem about the need for an orthonormal tetrad de- 
pends heavily, for its significance at least, on failure to imagine the OP spinor formalism, 



one can imagine a counterfactual history, a rationally reconstructed one Lakatos, 1971 



one in which OP spinors are invented before Weyl and Cartan circulate their theorem. 
It seems obvious that in such a world, the Weyl-Cartan spinor theorem, when invented, 
would be taken to imply the rather pedestrian conclusion that spinor fields require using 
nonlinear geometric objects (qualified by coordinate restrictions and spinorial double- 
valuedness), not that tensor calculus (broadly construed) is inadequate or that an or- 
thonormal basis is necessary. While an orthonormal tetrad might be introduced at times 
for convenience, yielding a linear representation of a larger group rather than a nonlin- 
ear representation of a smaller group |Gates et al., 1983] , no one would think the tetrad 
spinor formalism worthy of much philosophical contemplation. De-Ockhamization can 
be convenient for technical purposes, but, at least without specific reason to think other- 
wise, it is not conceptually interesting. The fact that introducing a tetrad would permit 
one the freedom to interchange the coordinates and thus obtain complete coordinate 
freedom would be thought far too trivial to justify the cost. It would be a way to fail 
to learn the lesson that the mathematics wants to teach, namely, that the signature 
makes a difference for spinors. Plausibly, a rational process of updating beliefs in light 
of evidence should be independent of the order in which the pieces of evidence arise 
Wagner, "2002| | Jeffrey, 2004| , at least if some technical conditions hold. The case at 



hand is nontrivial in that, like the problem of old evidence, it involves an expansion in 
logical-mathematical awareness, something that an ideal Bayesian agent could never ex- 
perience. Perhaps one can apply some strategies that have been used for the problem of 
old evidence |Garber,T9 83|. There might be an interesting methodological question here. 
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In any case, returning to the spinor example at hand, it seems evident that in a more 
rational world, OP spinors would be the default spinor formalism, especially for foun- 
dational purposes, while a tetrad might be introduced for convenience in some practical 
calculations in cases where it exists. 
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